Marius Hofert, Riidiger Frey, Alexander J. McNeil 


Princeton University Press 


The Quantitative Risk Management 
Exercise Book 


Solution manual (version of 2020-03-31) 


To Saisai 
To Catharina, Sebastian and Michaela 


To Janine, Alexander and Calliope 


Preface 


Over the past fifteen years we have taught many courses on Quantitative Risk Management 
(QRM) to students, academics, financial practitioners and actuaries based on the textbook 
Quantitative Risk Management: Concepts, Techniques and Tools (McNeil, Frey and Embrechts; 
2005, 2015). Henceforth we refer to this book as the QRM textbook and use the abbreviated 
citations MFE (2005) and MFE (2015) for the two editions. 

We have used the QRM textbook to deliver lecture courses to students at our own academic 
institutions including ETH Zurich, Heriot-Watt University, the University of Leipzig, the 
Vienna University of Economics and Business (WU), the University of Washington (UW), the 
National University of Singapore (NUS) and the University of Waterloo. We are also aware 
that many colleagues have used the QRM textbook to teach courses at universities around the 
world. 

The QRM textbook has also formed the basis for numerous intensive training courses, 
workshops and summer schools which have been delivered to financial practitioners and 
actuaries. In particular, it has been used as part of the CERA (Chartered Enterprise Risk 
Actuary) education programme of the European Actuarial Academy (EAA). 

Throughout this time, we have been conscious that the QRM textbook has lacked a formal 
exercise collection to assist instructors and to provide students with the opportunity to test, 
consolidate and extend their knowledge. The QRM Exercise Book aims to fill this gap. It 
contains a comprehensive selection of the exercises that we have amassed during our own 
teaching experience, as well as a few exercises donated by colleagues. 

The structure of the exercise book follows Chapters 1-11 of MFE (2015). Within chapters, 
exercises are typically grouped into three categories: review, basic and advanced questions. 
Review questions are designed to help readers to test their understanding of the concepts 
presented in the corresponding chapter of MFE (2015). As such, the solutions are usually 
found in MFE (2015) or involve at most simple calculations based on material in MFE (2015). 

Basic questions are designed to test and enhance the understanding of the technical material. 
Typical examples might require students to calculate specific examples based on general 
techniques or might ask for simple proofs and minor extensions of results in MFE (2015). Taken 
together, review and basic questions are intended to be at the level of examination questions 
and should not require particularly long answers. Advanced exercises are either more lengthy, 
more difficult or take the reader somewhat beyond the textbook material. 

Occasionally, a programming exercise is included. We recommend that these are tackled 
using the open source R language and environment for statistical computing. To facilitate the 
solution of programming exercises we provide references to publicly available R datasets and 
packages in the questions. 

In view of the more qualitative and discursive nature of Chapter 1 of the QRM textbook, 
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the exercises and solutions are structured in a slightly different way. Case study and discussion 
questions take the place of basic and advanced questions respectively. Case study questions 
invite students to research a number of well-known company failures, market crashes and 
natural catastrophes and consider their causes and/or their financial and economic implications. 
Discussion questions are more open-ended questions that address, for example, current trends 
and controversies in risk management. 

The primary resource for solving these exercises is MFE (2015) although the exercise book 
also contains references to other books and papers that take the reader further in certain areas. 
An updated book of solutions to the exercises and a set of R scripts solving the programming 
exercises are available on the QRM Tutorial website qrmtutorial.org. For instructors there 
are some additional resources on the website including slides and extensive demonstration 
scripts in R. 

We are grateful to Paul Embrechts who initiated the QRM project during his time as Professor 
of Insurance Mathematics at ETH Zurich. We have all benefited from his inspiration and 
mentoring over the years. We would also like to thank the Forschungsinstitut für Mathematik 
(FIM) at ETH Zurich for its financial support during multiple visits by the authors. Thanks are 
also due to several doctoral and postdoctoral teaching assistants in Zurich who helped develop 
the stock of examples and exercises for teaching QRM, including Valeria Bignozzi, Bikramjit 
Das, Catherine Donnelly, Edgars Jakobsons, Georg Mainik and Johanna G. Neslehova, as well 
as to colleagues, friends and students around the world who have contributed exercises and 
solutions, in particular, Andrew Cairns and Jialing Han. 
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1 Risk in Perspective 


Review 


Exercise 1.1 (Types of financial risk) 
Give definitions and examples for each of the following types of risk in the financial context: 


a) market risk, 
) credit risk, 


c) operational risk, 


c 


a 


) model risk, and 


e) systemic risk. 


Exercise 1.2 (Financial crisis of 2007—9) 
Give an outline of the financial crisis of 2007-9 describing the main events and reasons for the 
crisis. 


Exercise 1.3 (Bank run) 
Explain what is meant by a bank run and give examples from history. 


Exercise 1.4 (Regulatory frameworks) 
a) Explain how the three-pillar concept works in the Basel and Solvency II regulatory frame- 
works. 


b) Compare and contrast the main features of the Basel and Solvency II frameworks. 


Exercise 1.5 (Procyclicality) 
Explain the meaning of procyclicality, particularly when used in relation to regulatory capital 
requirements. 


Exercise 1.6 (Basel III) 
Summarize the main changes to the Basel framework that are adopted in Basel III and explain 
how these address shortcomings of the previous Basel II Accord. 


Exercise 1.7 (Capital adequacy, leverage and liquidity coverage ratios) 
Explain the essential differences between capital adequacy ratios, leverage ratios and liquidity 
coverage ratios as applied in the Basel III accord. 
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1 Risk in Perspective 


Case Studies 


Exercise 1.8 (Major failures) 
Select from the following list of well-known failures of financial risk management. In each case, 
summarize what happened and identify the risks that the company or organisation failed to 


a) Barings Bank 

b) Metallgesellschaft (MG) 

c) Sumitomo 

d) Société Générale 

e) Orange County 

f) Long-Term Capital Management (LTCM) 
g) American International Group (AIG) 

h) Amaranth Advisors 


Exercise 1.9 (Market crashes) 
Investigate what happened during the following well-known market crashes and describe any 
changes that have resulted from these events, or any lessons that risk managers should draw. 


a) The Wall Street Crash of 1929. 
b) The ‘Black Monday’ event of 19 October 1987. 
c) The ‘Flash Crash’ of 6 May 2010. 


Exercise 1.10 (Natural catastrophes) 
Research the following natural catastrophes and describe their economic, financial and insurance 
consequences. 


a) The Kobe Earthquake (1995). 
b) Hurricane Katrina (2005). 
c) The Thailand Floods (2011). 


Discussion 


Exercise 1.11 (Risk and uncertainty) 
Attempt to formulate your own succinct definition of risk. What distinguishes risk from 
uncertainty in your opinion? 


Exercise 1.12 (The Q in QRM) 
Discuss whether quantitative methodology should play a larger or a more reduced role in risk 
management systems in the future. 


Exercise 1.13 (Trends in financial regulation) 
What are the current trends in financial regulation? In particular: 
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Discussion 


a) Has risk regulation become too complex? 
b) Is there evidence of a movement away from internal models back to simpler standardized 
approaches? 


Exercise 1.14 (Regulation and credit provision) Sol. p. 107 
An often-heard criticism of Basel III by banking practitioners is that “increased regulation 
strangles credit provision”. Discuss the arguments for and against this proposition. 


Exercise 1.15 (Shadow banking and insurance) Sol. p. 107 
a) What is meant by the shadow banking industry? 


b) Is there also a shadow insurance industry? 
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2 Basic Concepts in Risk Management 


Review 


Exercise 2.1 (Notions of capital) 

In each of the following situations a certain notion of capital discussed in MFE (2015, Sec- 
tion 2.1.3) is most relevant for the decision maker. Explain which notion of capital that 
is. 


a) A financial analyst who uses balance-sheet data to value a firm. 


b) A chief risk officer of an insurance company who has to decide on the appropriate level of 
reinsurance. 


c) A regulator who has to decide on shutting down a bank with many bad loans on its book. 


Exercise 2.2 (Different notions of financial distress) 
a) Briefly explain the difference between illiquidity, insolvency, default and bankruptcy. 
b) Describe a scenario where a financial company is insolvent but has not defaulted. 


c) Describe a scenario where a financial company has defaulted but is not insolvent. 


Exercise 2.3 (Valuation of a real-estate investment) 

Suppose the manager of a real-estate fund has to value a particular flat in Zurich, Switzerland. 

Relate the following three methods for the valuation of the flat to the valuation methods 

discussed in MFE (2015, Section 2.2.2). 

a) The manager takes the purchase price of the flat from several years ago and reduces it by 
an annual depreciation of 1% to allow for wear and tear. 

b) The manager finds transaction prices for similar flats in the neighborhood and uses them to 
compute a price per square metre. She then multiplies the square-metre price by the size of 
the flat. 

c) The manager estimates prices per square meter from a broad Swiss property price index 
and makes an ad hoc adjustment of 20% to account for the location of the flat in Zurich. 


Exercise 2.4 (Translation invariance of risk measures) 
Show that a translation-invariant risk measure can be interpreted as the amount of capital 


that needs to be added to a position so that it becomes acceptable to a regulator. 


Exercise 2.5 (Subadditivity of risk measures) 
Explain why subadditivity is often considered a desirable property of a risk measure. 
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2 Basic Concepts in Risk Management 


Exercise 2.6 (VaR and expected shortfall) 
a) Give mathematically precise definitions of value-at-risk VaR4(L) and expected shortfall 
ESa(L) for a random loss L at confidence level a € (0, 1). 


b) Explain the relative advantages of each risk measure over the other. 


Exercise 2.7 (Superadditivity scenarios for VaR) 
Describe some models for financial losses that can lead to situations where VaR is superadditive. 


Exercise 2.8 (Additivity for two linearly dependent random variables) 
Consider an arbitrary random variable X and let Y = aX + b for constants a > 0 and bE R. 
Show that VaR4(.X + Y) = VaRa(X) + VaR.(Y) for a € (0, 1). 


Basic 


Exercise 2.9 (Risk-neutral valuation for interest-rate derivatives) 

Consider a two-period model. Denote by r+, t € {0,1}, the simple interest rate from t to t +1, 
so that 1 monetary unit invested at t is worth 1 + r; at t+ 1. Assume that rg is 1.5% and 
that rj takes the values 1% and 2% with probability 1/2. Denote by p(t, T) the price at t of a 
zero-coupon bond with maturity T and face value 1. 


a) Write down p(0, 1) and p(1,2) for the cases rı = 0.01 and rı = 0.02. 

b) Suppose a long zero-coupon bond with maturity T' — 2 and face value 1 is traded for 
0.969729 at t = 0. In this setup an equivalent martingale measure Q is characterized by the 
probability q = Q(r; = 0.01). Compute q from p(0, 2). 

c) Apply risk-neutral valuation to price a stylized floor contract which pays an amount of 1 if 
T] « TQ. 

Note. In general, a floor contract is an option which provides protection against low interest 
rates. 


Exercise 2.10 (Mapping of a stock portfolio affected by exchange rates) 

Consider a portfolio P consisting of two stocks 5,1, St,2, where S; denotes the value of stock 1 

in EUR and S$; denotes the value of stock 2 in CHF. Let ef #" denote the CHF/EUR exchange 

rate at time t. In other words, 1 CHF is worth ECHE EUR at t. Furthermore, denote by A1 and 

A2 the number of shares in stocks 1 and 2 in P, respectively. 

a) Derive the value V, in EUR of P at time t in terms of the risk factors Zi; = log Stj, 
j € {1,2}, and Z,3 = log e@4*. What is the corresponding mapping? 

b) Derive the value Vii; of P at time t + 1 and the one-period loss L441. 

c) Derive the linearized one-period loss ho and express it in terms of portfolio weights w1, w2 
(the values of each stock investment relative to the value V; of the overall portfolio). 
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Basic 


Exercise 2.11 (Properties of VaR and expected shortfall) 


a) Show that VaR is a monotone, translation invariant and positive-homogeneous risk measure. 


b) Why can we conclude that ES, also satisfies these properties? 


Exercise 2.12 (VaR and ES for continuous distributions with finite mean) 
Compute VaR4(L) and ES4(L) in the following cases: 


a) L has an exponential distribution with rate parameter A > 0, that is L ~ Exp(A). 
b) L has a log-normal distribution, that is log L ^ N(p, 0°). 


c) L has the Weibull distribution with distribution function F(x) = 1 — exp(—/z/10), x > 0. 


d) L has a Pareto distribution Pa(0, 1) with distribution function F(z) = 1 — (14- z)-?, x > 0, 
0 > 0. Under what condition on 0 does ESQ(L) exist? 


Exercise 2.13 (VaR and ES for a distribution function with jumps) 
Suppose that the loss L has distribution function 


0, x<l, 
F(z) = 41-1/(1+2), z€[L3) (E1) 
1 — 1/z?, qo. 


a) Plot the graph of F. 
b) Compute value-at-risk VaRa(L) at confidence levels a = 85% and a = 95%. 
c) Compute expected shortfall ES,(Z) at confidence level a = 85%. 


Exercise 2.14 (VaR for a binomial model of a stock price) 
Consider a portfolio consisting of a single stock with current value 5, = 100. Each year, the 


stock price either increases by 4% with probability 0.8 or decreases by 4% with probability 0.2. 


Compute VaR for a € {0.7,0.95, 0.96, 0.99} over a time horizon of two years. 


Exercise 2.15 (VaR and ES for a discrete distribution) 
The following table contains the net profits on two lines of business A and B of a company 
XYZ: 


Net profit 
Outcome Probability Line A Line B 
wI 0.82 1000 1000 
we 0.04 1000 0 
03 0.04 0 1000 
w4 0.02 0 0 
Ws 0.04 1000 —10000 
w6 0.04 —10 000 1000 


a) Calculate VaRo.95 and ESo.95 for each of the business lines A and B. 
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2 Basic Concepts in Risk Management 


b) Calculate VaRo.95 and ESo.95 for the combined profits of A and B. 


How do your answers fit in with your knowledge of the coherence of the value-at-risk and 
expected shortfall risk measures? 


Exercise 2.16 (Expected shortfall for t distributions) 
Compute ES, for a standard t distribution with v degrees of freedom, L ~ ty. Give a condition 
that guarantees that ES,(L) exists. 


Exercise 2.17 (VaR and ES for bivariate normal risks) 

Consider two stocks whose log-returns are bivariate normally distributed with annualized 
volatilities c4 = 0.2, o2 = 0.25 and correlation p = 0.4. Assume that the expected returns are 
equal to 0 and that one year consists of 250 trading days. Consider a portfolio with current 
value V, = 10° (in EUR) and portfolio weights w; = 0.7 and wz = 0.3. Furthermore, denote 
by Ta the linearized loss (using log-prices of the stocks as risk factors). Compute the daily 
VaRo.99( LÀ 4) and ESo.99(L 1) for the portfolio. How does the answer change if p = 0.6? 


Exercise 2.18 (Basic historical simulation) 

Consider two stocks, indexed by j € {1,2}, with current values S;; = 1000 and $,5 = 100 in 
some unit of currency. The monthly log-returns of both stocks over the last 10 months are 
given in the following table (values in 96): 


Lag k 10 9 8 7 6 5 4 3 2 1 


Log-return of Sı at lag k —16.1 5.1 0.4 2.5 4 105 5.2 —29 19.1 0.4 
Log-return of So at lag k | —8.2 3.1 0.4 -15 -3 45 2 —3.7 10.9 —0.4 


Table E.2.1 Monthly log-returns of the two stocks over the last 10 months (in 96). 


Use historical simulation to estimate the one-month VaR at confidence level o — 0.9 for the 
linearized loss L^ for the following two portfolios: 
a) A portfolio consisting of two shares of the first stock $4. 


b) A portfolio consisting of one share of the first stock and 10 shares of the second stock. 


Exercise 2.19 (Axioms of coherence for standard deviation) 

Consider the risk measure o(L) = vvar L on the space M = L?(0,F,P) of random vari- 
ables with finite second moment. For each axiom of coherence, either prove it or provide a 
counterexample to show it does not hold. 


Exercise 2.20 (Superadditivity of VaR for two iid Bernoulli random variables) 

Let Li, Lə be independent and identically distributed (iid) Bernoulli random variables such 
that P(L; = 0) = 1 — p and P(L; = 1) =p, j € {1,2}, for some p € (0,1). For which a € (0,1) 
is VaR superadditive, that is VAR4(L1 + L2) > VaRa(L1) + VaRa(L2)? 
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Basic 


Exercise 2.21 (VaR under strictly increasing transformations of a loss) 
Let L be a random loss and h : R — R a continuous and strictly increasing function. Show that 


VaRa(h(L)) = h(VaRa(L)), o € (0,1). 


Exercise 2.22 (Subadditivity of VaR for bivariate normal random variables) 

Let (L1, L2) have a bivariate normal distribution with u; = E(L;), o? = var(L;), j € {1,2}, 

and p = corr( L1, Lə). 

a) Compute VaRq(L;), j € {1,2}, and VaRa(Lı + Lə). 

b) Show that VaR is subadditive if and only if a € [1/2, 1). 

c) Plot the graphs of a — VaRa(Lı + L3) for standard normal Lı, Lə with correlation values 
p € (—1, —0.5,0,0.5, 1). Comment on the monotonicity behaviour with respect to p. 


Exercise 2.23 (Shortfall-to-quantile ratio for exponential, normal and Pareto) 
Calculate the ratio 


ESq(L) 


[ip D ee 
a>1— VaRa(L) 
in the following cases. Comment on the differences from a risk management perspective. 
a) L ~ Exp(A) with distribution function F(x) = 1 — exp(—Az), x > 0, A > 0 (a light-tailed 
distribution). 
b) L ~ N(0, 1) (also light-tailed). 
c) L~ Pa(0,1) with distribution function F(x) = 1 — (1 + z)-?, x > 0, 0 > 1 (a heavy-tailed 
distribution). 


Exercise 2.24 (Matching VaR and ES) 

Under the Basel III guidelines, the risk measure underlying the capital requirements for market 

risk has changed from VaR to expected shortfall; see BIS (2016a) and BIS (2017). In order that 

the overall capital requirement remains constant, the confidence level of ES can be adjusted. 
Use numerical root finding to determine the value of a such that ESS(L) = VaRo,.99(L) in 

the following cases. 


a) L~ N(u, o). 
Hint. First show that it is sufficient to consider L ~ N(0, 1). 
b) L c t35. 


Exercise 2.25 (Matching VaR and ES for Pareto distributions) 


Let 6 € (0,1) and L ~ Pa(0,1) with distribution function F(x) = 1— (1+2), 2 20,0 » 1. 


a) Determine analytically the confidence level o € (0, 5] such that ES,(L) = VaRg(L) and 
give the range of 8 values for which the equation has a solution. 


b) Suppose that 8 is such that the equation has a solution. Show that a is given by an 
increasing function of 0 and explain why this is the case. 
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Advanced 


Exercise 2.26 (Superadditivity of VaR for two iid exponential random variables) 
For j € {1,2} let Fj denote the distribution function of L; and let Fr, 41, denote the distribution 
function of Ly + Lo. 


a) Show that superadditivity of VaRq is equivalent to 
a > Fa uus (Fi; (0) + Fi;(0)). 


b) Now assume that Lı and Lə are iid exponentially distributed with distribution function 
F(x) = 1 —exp(-Az), x 20, A > 0. Show that VaR, is superadditive if and only if 
(1— a)(1 — 21og(1 — o)) > 1. 


c) Determine the set of œ values for which VaR, is superadditive using numerical root finding. 


Exercise 2.27 (Superadditivity of VaR for iid Bernoulli random variables) 
For d > 2 let Y;,..., Yq be iid Bernoulli risks such that Y; ~ B(1, p) for p € [0,1]. Show that 
VaR, is superadditive if and only if (1 — p)? < a € 1 — p. 


Exercise 2.28 (Superadditivity of VaR for a heavy tailed distribution) 

Let Lı and L3 be independent random variables with distribution function F(x) = 1 — z-1/2, 
€ [1, oo). This is a form of Pareto distribution with infinite mean. Show that VaRa(L1-- L2) > 

VaRa(L4) + VaRo(L2) for all a € (0, 1). 


Hint. Show that the distribution function Fr,,r, of Li + La is given by Fr +(x) = 1— 


2/z — 1/x, zr > 2. 


Exercise 2.29 (Median shortfall) 
Let the loss L have continuous distribution function Fr. The risk measure median shortfall at 
confidence level a € [0,1) is given by 


MSa(L) = Fiir van.) 0/2); 


where Fr|r.van,(r) denotes the conditional distribution of L given that L exceeds VaR4(L). 
a) Express Frjr.van,(r) in terms of Fy. 
b) Show that MS4(L) = VaR ı+a (L) and interpret this result. 

2 


c) Let Fr(zx) = 1 — (14+ z)79, x > 0, 0 > 1, be the distribution function of the Pareto 
distribution. Compute 


lim ESo(L) 
oa1— MS,(L) 


and interpret this result from a risk management perspective. 
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Advanced 


Exercise 2.30 (Shortfall-to-quantile ratio for t) Sol. p. 126 
Let L ~ t,(u,0?) for v > 1, u€ R, o > 0. Compute VaRa(L) and ES,(L). Show that 


ESQ(L) v 
acus VaRe(h) ^ uox 


and interpret this result from a risk management perspective. 
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3 Empirical Properties of Financial Data 


Review 


Exercise 3.1 (Stylized facts) 
Give a list of univariate stylized facts for financial time series. 


Exercise 3.2 (Negative log-returns and the correlogram) 

Figure E.3.1 shows the negative log-returns of the S&P 500 index from 3 January 2011 to 31 
December 2015 (left) and the corresponding correlogram or sample autocorrelation function 
(ACF) plot (right). 
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Figure E.3.1 Negative log-returns of the S&P 500 index from 3 January 2011 to 31 December 
2015 (left) and the corresponding correlogram (right). 


a) Explain how the correlogram in Figure E.3.1 should be interpreted. 
b) Which stylized fact(s) do the two plots in Figure E.3.1 support? 


c) Explain why it is not sufficient to test whether a dataset satisfies the iid hypothesis by 
looking at the correlogram of the dataset alone. What else should you do? 
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Exercise 3.3 (Interpreting Q-Q plots) 

Figure E.3.2 shows a Q-Q plot of the negative 34 o 

log-returns of the price of one Bitcoin in USD 

from 29 May 2014 to 29 May 2018 against a 

fitted normal distribution. 

a) What does this Q-Q plot suggest about 
the distribution of Bitcoin log-returns? 


o 


0.0 


b) What numerical test might you carry out 
to support your answer to part a)? 


c) Although the QQ-plot is against a fit- 
ted normal distribution, the ordered data i 
could also have been plotted against the o 
quantiles of a standard normal. Explain | | ! ' T 

-0.10 -0.05 0.00 0.05 0.10 
why this is the case. 


Sample quantiles of -log-returns of Bitcoin 


Location: -0.0017, scale: 0.0395, sample size: 1460 


Quantiles of the fitted normal distribution 


Figure E.3.2 Q-Q plot of Bitcoin returns 
against a normal distribution. 


Exercise 3.4 (Multivariate stylized facts) 
Give a list of multivariate stylized facts for financial time series. 


Exercise 3.5 (Joint negative log-returns and the cross-correlogram) 
Figure E.3.3 shows joint negative log-returns of the S&P 500 and the DAX index from 3 
January to 31 December 2015 (left) and the corresponding cross-correlogram (right). 


a) Based on your knowledge of the correlogram, how do you think the cross-correlogram in 
Figure E.3.3 should be interpreted? 


b) Which stylized fact(s) do the two plots in Figure E.3.3 support? 


c) How would you expect the cross-correlogram of the squared negative log-returns to look? 


Exercise 3.6 (Longer-interval log-returns) 


Let S:,...,5:4, denote the daily values of an asset over a period of h > 1 days and 
X+41,---,Xt+p the corresponding daily log-returns. 
a) Give a formula for the h-day log-return 20 in terms of X441,..., Xt+n for h > 1. 


b) Explain why x may be approximately normally distributed for large h (corresponding 
to a quarter or year) even if the daily returns are not. 


Basic 


Exercise 3.7 (Manipulating logarithmic and relative returns) 
For an asset price with daily values given by the time series (S4) the h-day log-return is defined 
by xm = log(St+n/ St) and the h-day relative return by ed = (Stn — St) /S; for all h > 1. 
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Figure E.3.3 Joint negative log-returns of the S&P 500 and DAX index from 3 January 2011 
to 31 December 2015 (left) and the corresponding cross-correlogram (right). 


a) Derive a formula for the h-day log-return X in terms of the h-day relative return y. 

b) Derive a formula for the h-day relative return y in terms of the h-day log-return x. 

c) Given a series of h-day log-returns xe. ae ... and an initial stock price $;, explain 
how you would reconstruct the series of prices St+h, St+2h,.-. at h-day intervals. 

d) Given a series of h-day relative returns Y. VAN ... and an initial stock price $;, explain 


how you would reconstruct the series of prices St+h, 91125,... at h-day intervals. 


Exercise 3.8 (Experimenting with Q-Q plots) 

Use statistical software to generate 1000 data points from the following distributions; the 
quantity of data corresponds approximately to four years of daily return data. In each case 
form Q-Q plots against the standard normal distribution and interpret the results. 

a) A Student ¢ distribution with 6 degrees of freedom. 


b) A standard uniform distribution. 


c) A standard logistic distribution (with distribution function F(x) = 1/(1+exp(—2)), x € R). 


d) A standard lognormal distribution (that is the distribution of a random variable whose 
logarithm follows a standard normal distribution). 


Exercise 3.9 (Ljung-Box test) 
Repeat the analysis of MFE (2015, Table 3.1) for the stocks that comprise the Euro Stoxx 50 
index. Specifically you should: 


a) Form daily and monthly log-returns for each time series from 2000 to 2015, omitting all 


23 


Sol. p. 133 


Sol. p. 133 


Sol. p. 133 


Sol. p. 133 


Sol. p. 134 


3 Empirical Properties of Financial Data 


stocks with missing data on the first trading day in 2000 and filling the remaining missing 
data by interpolation. 


b) Carry out Ljung-Box tests on both the raw and absolute returns to address the null 
hypothesis that stock returns are iid. 


The data can be found in the dataset EURSTX, const of the R package qrmdata. The function 
na.fillO of the R package zoo might be helpful. 


Exercise 3.10 (Jarque-Bera test) 
Repeat the analysis of MFE (2015, Table 3.2) for the stocks that comprise the Euro Stoxx 50 
index using the same data as in Exercise 3.9. Specifically you should: 


a) Form weekly, monthly and quarterly log-returns. 


b) Carry out Jarque-Bera tests of normality for each stock and each time step. 


Exercise 3.11 (Skewness, kurtosis and higher moments of standard normal) 


a) Let Z be a random variable with a density f that is symmetric around 0. Show that if the 
kth moment exists it is given by 


g(Z) = 2 foo z* f(z) dz, kis even, 
0, k is odd. 


b) Let Z ~ N(0,1). By writing {5° z*¢(z) dz in terms of the gamma function, derive the 
general formula 


sz = (IU (23-1), Eis even, 
0, k is odd. 


Hint. You may use the fact that T(x + 1) = Tr (x), x > 0, for the gamma function and the 
identity [(1/2) = m. 

c) Let X be a random variable with a finite kth moment. The kth standardized moment of X 
is defined by E((X — u)^)/o^, where u = E(X) and o = \/var(X) denote the mean and 
standard deviation of X. The third and fourth standardized moments of X are known as the 
skewness and the kurtosis x. Compute the skewness, kurtosis and the 6th standardized 
moment of a normal random variable X ~ N(u, 07). 


Exercise 3.12 (Changing correlation over time) 
Repeat the analysis of MFE (2015, Figure 3.7) for the S&P 500 and DAX log-returns from 3 
January 2011 to 31 December 2015 (the data used in Exercise 3.5). Specifically you should: 


a) Estimate correlations using a rolling 25-day window and plot the time series of estimated 
correlations. 


b) Experiment with different window sizes and investigate how the plot changes. 
The data can be found in the datasets SP500 and DAX of the R package qrmdata. 
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Exercise 3.13 (A simple version of the runs test) 
In this question we study a simple version of a test for iid data known as the runs test. Let 
X1,..., Xn be return data from a continuous distribution that is approximately symmetric 
around zero. Let Y, = sign( X+) denote the sign of X+. Throughout this question we will assume 
that P(Y; = —1) = P(Y; = 1) = 1/2. 

The runs test is based on the idea of counting the number of runs, Rn, of + values and — 
values in Y1,...,Y,. For example if the whole sequence is + 4 | + + then the 


number of runs is 5. 


a) Explain why the number Rp of runs is given by 
n 
ly, 
-(Y; — Yi 
£e) 


b) Show that $75.» iQ — Y; 41)? has a binomial distribution under the null hypothesis that 
X1,..., Xn are iid. Determine the parameters of the binomial distribution. 

c) Using the normal approximation to the binomial distribution, explain how you could 
implement a test of the null hypothesis of independence based on the statistics Ry. 

d) Suppose we find no evidence against the iid hypothesis for the return data X4,..., Xn. How 
could the runs test be adapted to test for clustering of large values in the squared data? 

Note. The standard Wald-Wolfowitz runs test does not assume that P(Y; = —1) = P(Y; 


1) = 1/2 and uses a slightly different procedure to the one described above; see, for example, 
Bradley (1968, Chapter VIII). 


Exercise 3.14 (Sensitivity of correlation estimates to volatility) 

This exercise is a continuation of Exercise 3.12. Repeat the analysis of MFE (2015, Figure 3.8) 

for the S&P 500 and DAX log-returns from 3 January 2011 to 31 December 2015. Specifically 

you should: 

a) Carry out a regression analysis to investigate whether correlation estimates calculated for 
non-overlapping 25-day blocks of observations tend to be higher in periods with higher 
volatility estimates. 


b) Repeat the analysis using rolling estimates of correlation and volatility. 


Exercise 3.15 (Geometric spacings between largest values in iid samples) 

This exercise explains the theory underlying MFE (2015, Figure 3.3). Let X1, X2,... bea 
sequence of iid random variables from a continuous distribution function F and that p = 
P(X, > u) = F(u) € (0,1) for some threshold u. Let To = 0 and T; = min{i > Tj-1 : X; > u}, 
JEN. 

a) Show that the spacings S; = T; — Tj 4, j € N, of exceedances of the X;'s over the threshold 


u form a series of iid random variables following a geometric distribution with parameter p. 


b) Explain how the insight in a) can be used to implement a graphical test for volatility 
clustering? 
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c) Show that the geometric distribution can be approximated by an exponential distribution 
with parameter p if p is small. 


d) Use computer software to investigate the quality of the exponential approximation for 
p € {0.01, 0.02, 0.04}. 


Exercise 3.16 (Log-returns implied by geometric Brownian motion) 

The geometric Brownian motion is a widely used model for stock price processes in mathematical 
finance. According to this model, the stock price process (S;):>0 satisfies $, = So exp((u — 
c?/2)t + oW) for u € R, c? > 0, where the stochastic process (W;)150 satisfies: 

1) Wo = 0 almost surely and t +> W; is continuous almost surely; 


2) For any n € N and 0 € to < tı <--- < tn, the increments Wi, — Wios ..., Wy, — Wa, , are 
independent; and 

3) For all 0 € s < t, W; — W, ~ N(0,t — s). 

Answer the following questions: 


a) What model does this assumption imply for the distribution of the h-period risk-factor 


change X ” = 


time series? 


— log(St+n/ St)? Is this model supported by stylized facts about financial 


b) Show that the two risk-factor changes x and X ae yn for h, h/ > 0 are independent and 


comment on whether this fact is supported by empirical properties of financial time series? 


26 


4 Financial Time Series 


Review 


Exercise 4.1 (Stationarity) 


a) State the definitions of covariance (or weak) stationarity and strict stationarity for a time 
series ( Xi)iez. 

b) Give an example of a covariance-stationary process that is not strictly stationary and vice 
versa. 


c) Explain why stationarity is such an important assumption in time series analysis. 


d) If you had time series data displaying a linear trend, explain how you might pre-process 
them to make them suitable for modelling with a stationary time series process. 


Exercise 4.2 (Identification of ARMA processes) 
Figure E.4.1 shows simulated paths for various time series processes as well as the corresponding 


sample autocorrelation function (ACF) and partial autocorrelation function (PACF) plots. 


In each case the innovations (€¢)zez are from a strict white noise process and the time series 
(X4)iez follows one of the following models: 


a) Xi = e —0.3e¢-1 — 046i 23, t € Z 

b) X; = 10+ 20t + 2tsin(t) +e, t € Z 

c) X4,—0.3X4,1—0.4X, 2— 6€, t EZ 

d) Xi = e&t —0.Te t€ Z 

e) X,—0.3X¢_-1 = &¢ —0.66 , t€ Z 

Match each model with the pictures, explaining your reasoning. 


Note. The PACF is mentioned briefly in MFE (2015, p. 108). The PACF of an AR(p) process 
cuts off after lag p and decays exponentially for an MA process. Note that this is the 
opposite behaviour to that of the ACF, which cuts off after lag q for an MA(q) process 
and decays exponentially for an AR process. Neither the ACF nor the PACF give clear 
guidance for indentifying the order of an ARMA(p, q) process with p > 0 and q > 0. See, for 
example, Brockwell and Davis (2002). 


Exercise 4.3 (GARCH models and stylized facts) 
Consider a GARCH(1, 1) process with parameters o = 2, o4 = 0.1 and f, = 0.85. 


a) Discuss whether or not this process can replicate all of the univariate stylized facts of 
financial time series listed in MFE (2015, Section 3.1). 
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Figure E.4.1 Paths of five time series (X;)iez (left) with corresponding ACF (middle) and 
PACF (right). 
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b) Calculate the variance of the GARCH process. 


c) Explain the form that the autocorrelation function of the squared GARCH process would 
take. 


Exercise 4.4 (White noise) 


a) Explain the difference between a white noise process (WN) and a strict white noise process 
(SWN). 
b) Give an example of a WN that is not SWN and a SWN that is not WN. 


Exercise 4.5 (AR models with ARCH errors) 


a) Write down the full set of equations for a time series (X;);ez following an AR(1) model 
with non-zero mean and ARCH(1) errors. 


b) Give a formula for value-at-risk calculated at time t, that is for the conditional quantile of 


X44 in terms of previous values of the process and quantiles of the innovation distribution. 


Exercise 4.6 (QML estimation of GARCH models) 

Explain how the quasi-maximum likelihood (QML) method for estimating the parameters of a 
GARCH process works and contrast the properties of the resulting QML estimates with those 
of full maximum likelihood estimates. 


Basic 


Exercise 4.7 (Checking stationarity) 
Decide whether the following time series (X+) are covariance stationary. Calculate the mean 
function and autocorrelation function of any time series that is covariance stationary. 


if t is odd, 

a) A= i : a 7 where (&:)iez is a WN(0, o2) process. 
Et41 if t is even, 

b) X, = h_o ex for t € No, where (ei)tex, is a WN(0, o2) process. 


c) X= sin( 5t) Z2 + cos( ft) Zi, t € Z, where Zı and Z2 are independent random variables 


with mean 0 and variance oe: 


Exercise 4.8 (Symmetry of ACF) 
Show that the autocorrelation function p of a covariance-stationary time series is an even 
function, that is p(—h) = p(h), h € Z. 


Exercise 4.9 (MA(2) process) 
Let (Xi)iez be a time series of the form 


X, = et + 14-14 0261.2, t€Z, 


where (€4)rez ~ WN(0, 02). 
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a) Show from first principles (that is without using general formulas provided in MFE (2015)) 
that (X;)iez is causal and stationary and compute the mean function and autocorrelation 
function. 


b) Assume that (X;),ez is an invertible process. Derive formulas for the predictions P; X,,j, 
for h = 1 and h = 2 and explain what happens when h > 3. 


Exercise 4.10 (Absolute summability condition) 
Let X, = Y $5. ss Vx&i- x for (Et)iez ~ WN(0, 02). Show that Y72-.. [Wk] < oo implies that 
E(* E [VkEt-k|) < œ or, in other words, $77. 4; V'x&i- x converges absolutely, almost surely. 


Exercise 4.11 (AR(1) process) 

This question is intended to fill in some of the details and extend the analysis of the AR(1) 
model in MFE (2015, Example 4.11). Let the process (.X;)iez be a stationary solution of the 
AR(1) equations 


X_— 1 Xt-1 =, t EZ, |di| <1, (E1) 


where (£i)iez ~ WN(0, 02). 

a) Show that if (Y:)zez is another stationary solution of (E1), then the process (Z;)ez defined 
by Z = X, — Y; is almost surely equal to zero for all t. 
Hint. Derive the recurrence relation Z; = ¢7Z:-n, n € N, and use the Cauchy-Schwarz 
inequality E(|Z;—n|) € 4/IE(Z2 ,,) to show that E(|Z;|) = 0 for all t € Z. 


b) Let Y; = 72-0 oet, t € Z. Show that this series converges absolutely almost surely. 
Hint. Apply Exercise 4.10. 


c) Show that (Y;)zez satisfies (E1) and hence conclude from a) that X, = Y;, almost surely, 
for t € Z. 


d) Compute the mean function and autocorrelation function of ( X4);ez. 


e) Derive a prediction formula for P; X4; for all h € N. 


Exercise 4.12 (ARMA(1,1) process) 
This question is intended to fill in some of the details of the analysis of the ARMA(1, 1) model 
in MFE (2015, Example 4.12). Let (X;)iez be a stationary and causal solution of 


Xi — Oh Xt-1 = Ee 0:61, t€Z, (E1) 


for some constants $4 Æ 0, 61 40 and (ei)iez ~ WN(0, 02), 0 < ox < oc. 


a) Assume that $1 Z —0;. By using the general ARMA theory in MFE (2015, p. 103) show 
that a necessary and sufficient condition for the equations (E1) to have a stationary and 
causal solution is |ó1| < 1. 


b) By using MFE (2015, Proposition 4.9), compute the autocorrelation function of ( X;)iez in 
this case. 


c) What is the solution of the equations (E1) when $1 = —01? 
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Hint. One can proceed as in the proof of Exercise 4.11 a). 


Exercise 4.13 (Kurtosis of GARCH(1,1) process) 
Let (X;)iez be a covariance-stationary GARCH(1,1) process with finite kurtosis kz of the 
innovation distribution. 


a) By using the solution of the GARCH(1,1) defining equations (see MFE (2015, equa- 
tion (4.27)) show that (.X;);ez has a finite fourth moment if and only if 


(a1 + 61)? <1- (Kz — lal, 


where o; and £1 are the ARCH and GARCH parameters, respectively. 


b) Suppose you know that the kurtosis kx of the GARCH(1, 1) process is equal to the kurtosis 
of the innovation distribution. Determine kx in this case and comment on whether this 
situation is likely to arise when modelling financial return data. 


Exercise 4.14 (GARCH(2,1) and GARCH(1,2) processes) 
Let (X;)iez be a covariance-stationary GARCH(2, 1) process. 


a) Derive a recursive formula for P,X?,, = E(X2,, | Fe) for h > 1. 


b) Compute the limit limp_,. PRXO and infer the condition for covariance stationarity from 
this limit. 
c) Repeat a) and b) for the GARCH(1, 2) process. 


Exercise 4.15 (Skewed innovation distributions) 

'This question concerns the method that is used in the R package rugarch to introduce skewness 
into the innovation distribution of a GARCH process. Let fx denote a unimodal density that 
is symmetric around 0 and consider the density fy given by 


2 


= 74 pai fx (6) H(—2) + fx(6-12)H(z)), «ER, 


fy (a) 


where ô > 0 is a skewness parameter and H(z) is the Heaviside function defined by H(x) = 0 
for x < 0, H(0) = 0.5 and H(z) = 1 for x > 0. 


a) Explain the intuition behind this construction. 
b) Verify that fy is a probability density and calculate the distribution function Fy of fy. 


c) Let X be a random variable with density fx and let Y be a random variable with density 
fy. Calculate the kth moment of Y in terms of the kth moment of X, assuming that the 
latter exists. 

d) The distribution of Y must be standardized to have mean zero and unit variance for use as 
the innovation distribution of a GARCH process. Calculate E(Y) and var(Y) in the case 
where X ~ N(0, 1). 


e) Repeat the calculations in the case where X ~ t, v > 1. 
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Exercise 4.16 (Fitting GARCH models to equity index return data) 
Take the set of closing prices of the S&P 500 index from 2006 to 2009 including the financial 
crisis; see the dataset SP500 of the R package qrmdata. 


a) Fit GARCH(1,1) models with standard normal, standardized t and standardized skewed t 
innovations to the negative log-returns; see the functions ugarchspec() and ugarchfit () 
in the R package rugarch. Plot the resulting standardized residuals and Q-Q plots against 
the fitted innovation distributions. 


b) Add AR(1), MA(1) and ARMA(1,1) models for the conditional mean and investigate 
whether this improves the resulting fit of the models. 


c) Use your favourite model to calculate an in-sample estimate of volatility and an in-sample 
estimate of the 99% value-at-risk for the period. 


d) Compare the GARCH volatility estimate with an exponentially-weighted moving-average 
(EWMA) estimate of volatility. Note that this can be carried out in rugarch by choosing 
an IGARCH model in ugarchspec() and setting the parameters to have fixed values. 


Exercise 4.17 (Fitting GARCH models to weekly equity index return data) 
Repeat the analysis of Exercise 4.16 for the weekly returns of the S&P 500 index from 2000 to 
2015. 


Exercise 4.18 (Fitting GARCH models to foreign-exchange return data) 
Repeat the analysis of Exercise 4.16 for daily log-returns of the EUR/USD exchange rate from 
2012 to 2015; see the dataset EUR_USD of the R package qrmdata. 


Advanced 


Exercise 4.19 (AR(2) process) 
Let (Xi;)iez be a stationary and causal solution of the AR(2) equations 


Xt — O1Xt-1 — G2Xi-2 = 6€, t€Z, 


where $1 € R, ¢2 £0 and (€)tez ~ WN(0, 02). 
a) Using the general ARMA theory in MFE (2015, p. 103), show that the AR. parameters 
must satisfy the following conditions: 
i) |2| < 1; 
ii) $9 + $1 < 1; and 
iii) $9 — $1 « 1. 
Note. One can also show that i)-iii) are sufficient. 
b) What shape is the set of permissible parameters (61, $2) satisfying i)-iii)? 


c) Derive a recurrence relation for the autocorrelation function of ( X;);ez of the form p(h) = 
a,p(h — 1) + asp(h — 2) for some ay, as. What is p(0), p(1)? 
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Exercise 4.20 (EGARCH(1,1) model) 

The exponential GARCH (EGARCH) model is a widely-used alternative to the standard 
GARCH model. The EGARCH(1,1) takes the form X; = o;Z; where (Zi)iez ~ SWN(O, 1) 
and the logarithm of the volatility c; is updated according to the equations 


log(o7) = ao + e1Zia + (|Zea| — E(IZeal)) + &ilog(oz4), t€Z, 


where ao, 04, 91,71 € R. 

a) Give an economic interpretation for the form of the squared volatility equations. 

b) Let & = oe3Z4-1 t *Y(|Zi-i| — E(|Z-1])), t € Z. Is the process (€t)tez a strict white noise? 
Calculate the mean and variance of £+, t € Z, under the assumption that Z is symmetrically 
distributed around 0. 

c) For t € Z, let Y; = log(o2) and let & be defined as in b). What kind of process does (Y;);cz 
follow? 


d) Give a condition for (Y;)4ez to be covariance stationary and derive the mean and variance 
of Yi, t € Z. 


Exercise 4.21 (GJR-GARCH(1,1) model) 

The GJR-GARCH model of Glosten et al. (1993) is another extension of the standard GARCH 
model. The GJR-GARCH(1, 1) model takes the form X; = eZ; where (Zi)iez ~ SWN(O, 1) 
and the volatility e; is updated according to the equations 


c2 = ap + (o3 + yix, ae) X24 + Bio, te Z, 


where ag > 0,a1 +71 > 0 and 6; > 0. 
a) Give an economic interpretation for the form of the squared volatility equations. 


b) By considering the variance of the process var(.X;), t € Z, show that a necessary condition 
for (Xi)tez to be covariance stationary is that 


aı + o nes <1, 


where 0z = E(Itz,<0}2Z?). 
c) Show that 07 = 1/2 when Z; has a distribution that is symmetric about 0. 


d) Derive an equation for predicting the squared volatility o? jj h = 1, given the values of X; 


and o7. 
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Exercise 5.1 (Maximum domain of attraction) 
a) What is the name of the theorem that describes the limiting behaviour of sample maxima? 


b) Explain what it means to say that “a distribution function F belongs to the maximum 
domain of attraction of Hz for some € € R”. 


c) Why is the knowledge that F € MDA(He) for some € € R so useful for modelling the 
distribution of maximal losses from F? 


Exercise 5.2 (Characterization of the MDA of a GEV for positive shape) 
Characterize the maximum domain of attraction of He for € > 0. In other words, state precisely 
which distribution functions F belong to MDA(He), € > 0? 


Exercise 5.3 (Block maxima method) 


a) Explain the block maxima method for estimating the parameters of a generalized extreme 
value distribution. 


b) Explain why the block maxima method involves a bias-variance trade-off. 


Exercise 5.4 (Excess distribution and mean excess function) 


a) For a random variable X ~ F, how are the excess distribution function over a threshold u 
and the mean excess function defined? 

b) What is the excess distribution function of the (memoryless) exponential distribution 
X ~ Exp(A), A» 0? 

c) What is the mean excess function of the exponential distribution? 

Exercise 5.5 (VaR, ES and the mean excess function) 


Let X ~ F be continuously distributed with E(|X|) < oo. Give a formula relating the expected 
shortfall (ES) of X to the value-at-risk (VaR) of X via the mean excess function. 


Exercise 5.6 (Peaks-over-threshold method) 
a) What is the name of the theorem that underlies the peaks-over-threshold (POT) method? 


b) Explain the POT method for estimating the distribution of excess losses above a threshold. 


c) Explain why the POT method involves a bias-variance trade-off. 


d) Does the POT method represent a more or a less efficient use of the data on extreme events 
than the block maxima method? 
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e) Explain the key ideas in the estimation of tail probabilities based on the POT method. 


Exercise 5.7 (Threshold choice in the peaks-over-threshold method) 
Figure E.5.1 shows sample mean excess plots for two sets of financial losses. 
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Figure E.5.1 Sample mean excess plots for two sets of financial losses. 


a) For each plot in Figure E.5.1 suggest a threshold u which could be used when applying the 
POT method. Justify your choice. 


b) Can you use the plots to give rough estimates of the shape parameter € of an approximate 
GPD model for the excesses over u for the two datasets? 


Exercise 5.8 (The tail index and its estimation) 
a) Define the tail index of a distribution F € MDA(H¢) for € > 0. 


b) Describe two possible approaches to estimating the tail index for a suitable dataset. 
Exercise 5.9 (The extremal index) 

Explain what is meant by the extremal index of a strictly stationary process (X;)iez.- 
Basic 


Exercise 5.10 (Max-stability of the Fréchet distribution) 
A random variable X is maz-stable if for any n € N the maximum Mn = max(Xi,..., Xn} of 
independent copies X4,..., Xn of X satisfies 


(M, — dn) /en = X 
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for real constants c, > 0 and dn. By determining appropriate sequences (cn) and (dn), prove 
that the Fréchet distribution with distribution function F(x) = exp(—2~°), x > 0, 0 > 0, is 
max-stable. 


Exercise 5.11 (A continuous distribution for which maxima do not converge to a GEV) 
Let X be log-Pareto distributed with distribution function Fx, that is X = exp(Y) for 
Y ~ Pa(a, x); see MFE (2015, Appendix A.2.8). Show that Fx € MDA(H;) for any € € R. 


Exercise 5.12 (Double logarithmic tail plot) 


a) Suppose you have data x1,...,2, which are believed to represent realizations of iid random 
variables with distribution function F € MDA(H¢) for € > 0. A popular diagnostic plot for 


such data is (x;, F(z;)), i € {1,...,n}, using logarithmic x and y-axes. Explain why. 


b) Explain how the plot can be used to construct an estimator of the tail index. 


Exercise 5.13 (Uniqueness of limit up to location and scale) 

In MFE (2015, Example 5.6) it is shown that if we take the Pareto distribution with distribution 
function F(x) = 1 — (k/(k4- x))?, z > 0, a,« > 0 and the normalizing sequences c, = kn!/%/a 
and dn = «(n!/* — 1) for n € N, then 


]im F (Cnt + dn) = Hijo (x) . 


Suppose we consider the alternative normalizing sequences c, = F* (1 — 1/n) and dn = 0 for 
n € N. Show that F" (cnx + dn) converges to H/4,,,, (x) as n — oo for some u € R and o > 0 
which you should determine. 


Exercise 5.14 (GPD with negative shape parameter) 
Let X follow a generalized Pareto distribution with shape € < 0 and scale 8 > 0. Derive the 
distribution of Y = —£X/8. 


Exercise 5.15 (Slow variation, regular variation) 

a) Show that the following functions are slowly varying at oc. 
i) L(x) = 2+ cos(1/x), x » 0. 
n) L(x) = log a; x > 0. 

iii) L(x) = 1/log z, z > 1. 

iv) L(x) = L(1/z) where L : [0,00) — [0,00) is slowly varying at 0. 


b) Show that the following functions are regularly varying at co and determine the correspond- 
ing index. 
i) h(z)—z 9,2 20,00. 
ii) h(x) = (1+ z)9, x » —1, 0 » 0. 
iii) h(x) = A(1/x), x > 0, where h is regularly varying at 0 with index a € R. 


Exercise 5.16 (GEV limit for maxima from the GPD) 
Show that Geg € MDA(He), EER. 
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Exercise 5.17 (GEV limit for maxima from the Burr distribution) 
Show that the Burr distribution with distribution function 


F(z) 21—(k/(&-Fa7), 20, 5,06,A» 0, 


is in the maximum domain of attraction of Hz for some € which you should determine in terms 
of the parameters of the distribution. 


Exercise 5.18 (The Weibull distribution as a special case of the GEV) 
For € « 0 the generalized extreme value (GEV) distribution takes the form 


Heg(x) = exp(—(1 + £r) 5) forall x <—-1/Eé. 


This case is referred to as a Weibull distribution although it differs from the standard Weibull 
distribution used in statistics and actuarial science. Let X ~ Hg and Y =1+€X. 


a) Derive the distribution function of Y and its domain. 
b) Verify that Y has a standard Weibull distribution with density 


f(y) = ev! lexp(-ey?), y» 0, 
for parameters c > 0 and y > 0 which you should determine in terms of £. 


Exercise 5.19 (An alternative formula for the mean excess function) 


a) Let X ~ F with E(| X|) < oo. Show that the mean excess function can be calculated using 
the formula 


eu) s F,,(x) dx = Fol F(x)dz, ue€[Ep,rr), 


where zp = inf{a € R: F(x) > 0} denotes the left endpoint of F. 
Hint. First show that xF(«) — 0 for x > xp. Conclude that Ie ou xFa(£) = 0. Then 
use the Stieltjes form of the integration by parts formula 

b 


[ reas = hoo]! - f soa: 


a 


see Apostol (1974, p. 144) or Arendt et al. (2002, p. 52). 


b) Use the formula in a) to calculate the mean excess function of a random variable X following 
a generalized Pareto distribution with shape £ < 1 and scale 6 > 0. 


Exercise 5.20 (Scaling of VaR, ES under a power tail) 
Let X ~ F where F(x) =1—c(k+2)~°, x > u, for some c > 0, x € R, 0 > 0 and u > 0. 
a) Show that, for all o; and ag satisfying F(u) < o1 € ag < 1, it holds that 


VaRa,(X) = ( KIR Oak 
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b) Let 0 > 1. Show that, for all a; and ag satisfying F(u) < o4 € o» < 1, it holds that 


- ast 
ESQ) = 2-5 J («+ VaRa, CX)) — K. 


c) Explain why these results are useful from a practical point of view. 


Exercise 5.21 (Block maxima method applied to S&P 500 return data) 
Imagine you are a risk manager sitting in your office in the evening of Friday 26 September 
2008. It has been a turbulent period in the financial market and the S&P500 index has dropped 
by 3.33% over the course of the week and by 11.82% over the past quarter. 

Use statistical software to carry out a block maxima analysis of daily negative log-returns 
of the S&P500 index since 1960 suing a block size of one year; see the dataset SP500 of the 
R package qrmdata. In other words, fit a GEV distribution Hg to the block maxima where 
0 = (£, 1,0). 

Answer the following questions: 

a) Compute the maximum likelihood estimator of the parameter vector 0. Based on the 
estimated shape parameter, which moments of the fitted GEV are finite? 


b) What is the probability that next year’s maximum loss exceeds the maximal loss over the 
past 20 years? 


c) What are (approximately) the 10-year and 50-year 260-block return levels r260,10 and 260,50, 
respectively? 


Note. The index 260 stands for the approximate number of trading days per year here. 


d) What is the return period k260,0.0922 of a loss at least as large as 9.22%, as seen on Monday, 
29 September 2008? 


Exercise 5.22 (Peaks-over-threshold analysis of Bitcoin data) 
The objective of this exercise is to use the POT method to analyse extreme negative log-returns 
on the dollar price of one Bitcoin from 1 January 2014 to 31 December 2017; see the dataset 
crypto of the R package qrmdata and the R package qrmtools for functions for implementing 
the POT method. 

Work through the following steps: 


a) Load the Bitcoin price data and plot it. Also compute the series of negative log-returns 
and plot these. 


b) For the negative log-return data, use a mean excess plot to find a suitable threshold u. 


Check your choice with a plot of fitted GPD shape parameters against various thresholds. 


c) Compute the maximum likelihood estimator of the GPD parameters based on the excess 
losses over your chosen threshold u. Use a Q-Q plot to verify whether the fitted GPD fits 
the excess losses well. Also provide a plot of the empirical distribution function of the 
excess losses overlaid with the fitted GPD to see whether the two match. 

d) Use a POT-based tail estimator to compute VaRo.99 and ESo.975 and compare the two 
values. 
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e) Plot the empirical tail probabilities F,,(x), where F, denotes the empirical loss distribution 
function, at all exceedances x of the threshold u and overlay the points with the tail 
estimator obtained from the POT method; see MFE (2015, equation (5.21)). Also indicate 
VaRo.99 and ESo.975 in the plot. 
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Exercise 5.23 (Alternative condition for convergence of normalized maxima) 
a) Show that 


PU Cer + dn) x H(x) if and only if nF(c,z + dn) dk Tog H(z). 


Hint. You may use that an — a € R for n — oo if and only if (1 + an/n)” — exp(a) for 
n —> oo. 


b) Use part a) to show that the exponential distribution function F(x) = 1 — exp(—Ax), 
x > 0, A> 0, belongs to the maximum domain of attraction of the Gumbel distribution 
Ho(x) = exp(—e *), x € R, for the normalizing sequences c, = 1/A and d, = log(n)/A, 
n € N. 


Exercise 5.24 (Maximum domain of attraction of a log-gamma distribution) 
In MFE (2015, Example 16.1) Karamata’s Theorem (MFE (2015, Theorem A.7)) is used to 
show that the Student t distribution is in the maximum domain of attraction of the Fréchet 
distribution. Use a similar method to analyse the behaviour of normalized maxima for the 
log-gamma distribution LGa(o, 8) with distribution function F and density 

Q 


f(z) = H3 (logz)?71z-0-1, z»1,«»0, 20. 


In other words, determine the € for which F € MDA(H;). 


Exercise 5.25 (Maximum domain of attraction under serial dependence) 
Suppose (Xn)nen is a sequence of iid random variables with distribution function F(x) = 
vV1-—z-?,r 21,0 0. Furthermore, let 


Yn = max (X»,-1, X2n}, n € RN, 
Zn = max{ Xn, Xni}, ncN. 


a) Clearly, (Yn)nen is a sequence of iid random variables. Compute the distribution function 
of Yj. 


b) Let MY = max{Yj,...,Y¥n}, n € N. Find real sequences (cn) and (dp), with cn > 0 for all 
n, and a non-degenerate distribution function HY such that (MY — d,)/c, converges in 
distribution to HY. 
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c) Compute the distribution function of Zi. 
d) Determine whether (Zn)nen is also a sequence of independent random variables. 


e) Let MZ = max{Z,..., Zn}, n € N. Using the same sequences (cn) and (dn) as in b) show 
that (MZ — dn)/cn converges in distribution to a limit H^. 


f) How are the limiting distributions HY and H^ related and what is the connection with the 
extremal index? 


Exercise 5.26 (A link between the Poisson, GPD and GEV distributions) 

Suppose that over some fixed period we observe a Poisson-distributed number of losses N ~ 
Poi(A) for some A > 0. Let the loss severities be described by an iid sequence of random 
variables (Y;)nen from a GPD with shape € € R and scale 8 > 0. Assume that the loss 
severities (Y;)ncn are independent of the loss frequency N. Let 


My =max{Yi,.2:,Yn}: 


Show that My ~ He uo where 


m 
A(R get aen 


Exercise 5.27 (Convergence of the excess distribution for a gamma distribution) 

Let X ~ Ga(2,1) be gamma distributed. In solving this exercise it will be seen that the 
Pickands-Balkema-de Haan Theorem is particularly easy to verify in this case. 

a) Compute the excess distribution function F, over a threshold u > 0. 

b) Show that 


lim su F(z) — Go4(x)| = 0. 
Jim, sup |i) — Goa) 


c) Using b), identify the maximum domain of attraction to which Ga(2, 1) belongs. 


d) Compute e(u) and explain why this implies the same conclusion as in c). 


Exercise 5.28 (Shortfall-to-range-shortfall ratio for a GPD tail) 

Let L have distribution function F € MDA(H;z) and assume that for some threshold u € [T p, £F) 
the excess distribution exactly satisfies F,(x) = Ge g(x), r € [0,ar — u), € € R, where 
Tr =inf{x € R: F(x) > 0} denotes the left endpoint of F. 


a) For 0 < o4 < a2 < 1, compute the range expected shortfall 


1 az 
VaR;(L) dz. 


ES, a, (L) zz amm 
ay 


b) For £ < 1, derive ESS(L). 
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c) For £< 1, show that for all y > 1, 


and interpret this result from a risk management perspective. 


Exercise 5.29 (Probability-weighted moments for the GEV distribution) 
If they exist, the probability-weighted moments of a random variable X with distribution 
function F are defined by 


Mp,r,s = E(X?F(X)"(1— F(X)?), prs ER. 


For continuous F and U ~ U(0,1), this implies that My, = E(F* (U)PU"(1— U)*). Consider 
the probability-weighted moments given by 8, = Miı,r,o and assume that F = H¢,,,, for some 
£«]1. 


a) Compute B, for r > —1. 


Hint. You may use the fact that [5* log(z) exp(—x) dx = —7, where y denotes the Euler- 
Mascheroni constant. 


b) Hence compute fo, 281 — fo and (385 — 8o) / (281 — Bo). 
c) Based on b) and estimates f, of 6,, design an algorithm for computing estimates of £, u, 0. 


Exercise 5.30 (Probability-weighted moments for the GPD) 

In this exercise we apply the method of probability-weighted moments as introduced in 
Exercise 5.29 to the estimation of a generalized Pareto distribution F = Gg under the 
assumption that £ « 1. For the GPD it proves easier to consider the probability-weighted 
moments given by as = M1 ,s. 

a) Compute o; for s > 0. 

b) Hence compute 2 — ag/(ag — 201) and 2a901/(a — 201). 


c) Based on b) and estimates &s n of as, design an algorithm for computing estimates of £, B. 
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Exercise 6.1 (Sums of normal random variables) 
Is it true to state that if L1 ~ N(u1, 0?) and Lo ~ N(u2, 03) then Ly + Lo ~ N( u1 + u2, o? 4-02)? 
Explain your answer. 


Exercise 6.2 (Independence of uncorrelated normal variables) 
Let X ~ Ng(p, X). Explain why we can infer that if X1,..., Xa are uncorrelated they are also 
independent. 


Exercise 6.3 (Stochastic representation for normal variance mixtures) 

Let X ~ Malu, X, A) be a random vector with a normal variance mixture distribution. Let W 
be a random variable with Laplace-Stieltjes transform H. Assume that X is positive definite 
with Cholesky decomposition X = AA’. 


a) Give a stochastic representation for X which shows how X may be generated starting from 


W and a vector of independent standard normal random variables Z = (Z1,..., Za)’. 

b) Assuming X;,..., X4 have finite variances, use this representation to explain why uncorre- 
latedness of X4,..., X4 does not imply their independence, unless W is constant almost 
surely. 


Exercise 6.4 (Normal variance mixtures as elliptical distributions) 
Using the stochastic representation in Exercise 6.3, show that X ~ Ma(p, X, H ) has an elliptical 
distribution. 


Exercise 6.5 (Distinguishing distributions by scatterplots of random samples) 
Figure E.6.1 shows random samples from nine bivariate distributions. Match the pictures to 
the following distributions: 


a) a normal distribution; 

b) a £ distribution with v = 3.5 degrees of freedom and positive correlation; 
c) at distribution with v = 3.5 degrees of freedom and negative correlation; 
d) a f£ distribution with v = 3.5 degrees of freedom and zero correlation; 

e) an elliptical distribution with discrete radial part; 

f) an elliptical distribution with bounded radial part; 


g) a mixture of a normal and a singular distribution; 
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h) the distribution of (X1, X2(1 — 21x, <o})) where (X1, X5) is jointly normal; and 


i) a normal mean-variance mixture. 


Exercise 6.6 (Advantages of generalized hyperbolic distributions) 
Describe some of the properties and features of generalized hyperbolic distributions which make 
them appealing for applications in financial risk management. 


Exercise 6.7 (Different kinds of factor models) 
Explain the key differences between macroeconomic, fundamental and statistical factor models. 


Basic 


Exercise 6.8 (Independence of uncorrelated Bernoulli variables) 

Let (X1, X2) follow a bivariate Bernoulli distribution with given joint probabilities P(X, = 
k, Xə = l) = py for k,l € {0,1}. Show that X4 and X» are uncorrelated if and only if they 
are independent. 


Exercise 6.9 (Calculating the correlation coefficient from a joint density) 
Consider a bivariate distribution function F with density 


f(z1,22) = 0(0 + 1)(xi 22 — 1) 0*2, 2,25 € [1,00), 6 > 0. 


a) Derive F. 
b) Derive the marginal distribution functions Fi, F> of F. 


c) Compute the correlation coefficient p corresponding to F. What assumption must we make 
about 0 in order that p exists? 


Exercise 6.10 (Cholesky decomposition) 
Let X be the positive-definite matrix 


Tm o? P0102 
poio 08 
for c1 > 0, og > 0 and p € (-1,1). Show that X has a unique Cholesky factor A. In other 


words, show that there exists a unique lower-triangular matrix A with positive diagonal entries 
satisfying X — AA'. 


Exercise 6.11 (Density of multivariate normal) 
Let Z = (Zi,..., Z4) where Zi,..., Z4 are iid N(0,1) random variables. Let X = w+ AZ 
where u € R? and A € R?*? with rank A = d. Clearly X ~ Na(u, X) where X = AA’. 


a) Write down the joint density fz(z) of Z. 
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b) Hence, by considering the transformation T(z) = + Az and applying the method of 
density transformation, show that the density of X is given by 


1 


(27) 4/2./det X i 


Exercise 6.12 (Conditional distribution of bivariate normal) 
Let (X1, X3) ~ No(s, ©) where p = (u1, u2) and assume that var(.X;) = cj > 0 for j € {1,2} 
and p( X4, X2) = p € (—1,1). By using the density derived in Exercise 6.11, show that 


fx(@) = 


xp( yc uE (s - 1)). x c RA. 


Oo 
X3| Xi mi N (us + p= (01 - m), o3(1 — p)). 


Exercise 6.13 (Covariance matrix of a mixture distribution) 

Let X = f(W,Z) be a random vector whose distribution depends on a random vector Z 
and an independent, non-negative scalar variable W, such as in the case of a normal variance 
mixture or a normal mean-variance mixture distribution. Assume the distributions of Z and 
W are chosen in such a way that the covariance matrix cov( X) exists. 


a) Derive the formula cov(.X) = E(cov(.X | W)) + cov(E(X | W)) in this setup. 
b) Calculate cov(.X) when cov( X |W) = WX and E(X |W) = w+ W^ for a covariance 
matrix 3 and parameter vectors ys and y. 


Exercise 6.14 (Normal variance mixture with discrete mixing variable) 
Let W be discrete with P(W = 1) = p and P(W = 4) = 1 — p for some p € (0,1). Furthermore, 
let Z and Z be iid N(0, 1) variables, which are also independent of W. For a € R, let 


Xi =v W(Zi + 2Z2) and Xə=a+vW Zo. 


a) Show that X = (X4, X2) has a normal variance mixture distribution. 
b) Compute the covariance matrix of X for p — 2/3. 
c) For p € (0,1), find the distribution function of X, + X». 


Exercise 6.15 (Normal variance mixture with Pareto Type | mixing variable) 
Let W ^ Pa(0), 0 > 1, with distribution function F(x) = 1 — x^?, x > 1, and let Z, and Z3 be 
iid N(0, 1) variables, which are also independent of W. Furthermore, let 


Xi, = WA + Z2) and X= VW = Zə). 


a) Show that X = (X4, X2) has a normal variance mixture distribution. 


b) Compute the covariance matrix of X and show that X1, X» are uncorrelated. 


Exercise 6.16 (Diversification and the single-factor model) 
Suppose that financial losses X1,..., X4 on d different assets are described by the single-factor 


model 
Xj = pZo+ J1— p?Zj, Pe lyse ts 


where p € (0,1) and Zo, Z1,...,Zq are iid N(0, g?) random variables. 
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a) Calculate the covariance matrix of X = (X1,..., Xj). 

b) Determine the joint distribution of X. 

c) Calculate VaR4(X4) and VaRa(Xa) for Xa = 1354 Xj and a € (0, 1). 

d) Does investing a capital of 1/d in each of X1,..., X4 decrease VaR, compared to investing 


the whole capital 1 in X,? Can this diversification effect be significantly improved by 
increasing d if d is already large? Explain your answers. 


Exercise 6.17 (VaR of the sum of jointly normal variables) 
a) Let (X1, X2) be normally distributed with correlation p € [71,1]. Compute VaRq(X1+ X2). 
b) Let X = (X1,..., Xa) ~ Na(p, £). Compute VaRo(X1 +--+ X4). 


Exercise 6.18 (Contrasting models for uncorrelated normal variables) 

Let (X1, X2) ~ N2(0, 2) and (Yi, Y) = (X,,V.X1) where V is independent of X, with 
PY = —1) = P(V = 1) = 1/2. 

a) Show that (X1, X2) and (Y1, Y2) have the same margins. 

b) Show that (X1, X2) and (Y1, Y2) are both uncorrelated pairs. 

c) Compute the distribution functions of X, + X? and Yı + Yo. 


d) Compute VaRa(Xı + X2) and VaR4 (Yi + Y2) for a = 0.9 and a = 0.99 and interpret the 
results. 


Exercise 6.19 (VaR-minimizing portfolio for a normal variance mixture) 

Consider the normal variance mixture X = /WY for Y ~ N4(0, £), where W > 0 is a random 

variable independent of Y and X is a positive definite covariance matrix. 

a) Determine the distribution of a/Y for a € R^ (0) and give a stochastic representation 
for a/Y in terms of a standard normal variable Z ~ N(0, 1); recall that this stochastic 
representation is a representation of the form a/Y = f (Z), for some function f, which 
shows how the distribution of a/Y may be stochastically simulated using Z. 

b) Using a) show that, for any a,b € R^M0], we can write a/X = Ca 5b X and derive a 
formula for the constant cap € R. 

c) Using b) derive a formula for VaR,(a’X) in terms of VaRa4( Xi). 


d) Suppose X — (ss ro and a = (a1,1 — a1). For a € (1/2,1), determine the value of a4 


that minimizes VaR4(a/ X) and give the corresponding value of VaR4(a/ X). 


Exercise 6.20 (Linear combinations of t random variables) 

a) Let X = (Xi,..., Xa) ~ ta(v, p, X). Use characteristic functions to show that the distribu- 
tion of the linear combination Ag + AX = Ao + E Aj X; is univariate t and identify the 
corresponding parameters. 

b) Now let X,..., Xa be iid random variables with a standard univariate ¢ distribution with 
v degrees of freedom, that is tı(v, 0,1) in notation of part a). Calculate the characteristic 
function of Ag + mra AjX; in this case and explain why this is not a special case of a). 
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Exercise 6.21 (Linear combinations of elliptical random vectors) 

Let X ~ Eal, ©, Y), B € R**? and b € R^. Use characteristic functions to show that 
Y = BX +b ~ Ex(Bp + b, BB’, v), that is show that linear combinations of elliptically 
distributed random vectors are elliptically distributed with the same type of distribution 
determined by wv. 


Exercise 6.22 (Convolutions of elliptical distributions) 
Let X ~ Eg(p, X, v) and Y ~ Eal ñ, cX, ij) for some c > 0 be independent. Use characteristic 
functions to show that 


X +Y ~ E(u + À, Y, v) 
and derive the form of the characteristic generator v. 


Exercise 6.23 (Fitting generalized hyperbolic distributions to equity return data) 

Take the daily log-return data from 2005 to 2012 for the following 10 components of the Dow 

Jones index: Apple, Cisco, Disney, IBM, Intel, McDonald's, Microsoft, Nike, Proctor & Gamble, 

and Walmart. Carry out the following analyses: 

a) Fit univariate t, NIG, hyperbolic, GH and VG distributions (both symmetric and asym- 
metric) to the daily log-returns for each stock. In other words, repeat the analysis of MFE 
(2015, Example 6.14) for these data. Summarize the results. 

b) Fit multivariate t, NIG, hyperbolic, GH and VG distributions (both symmetric and asym- 
metric) to the 10-dimensional data. In other words, repeat the analysis of MFE (2015, 
Example 6.15) for these data. Summarize the results. 

c) Assume you have invested an equal amount of money in each of these stocks. For the 
multivariate model that you find best, plot the implied distribution of daily portfolio returns. 

The R package ghyp contains code to estimate all of these distributions. For the data see the 

dataset DJ_const of the R package qrmdata. 


Exercise 6.24 (Fitting a one-factor model to equity return data) 

Take the daily return data used in Exercise 6.23 and the series of daily log-returns for the Dow 

Jones 30 index for the same period. 

a) Fit a single-index model and use it to estimate the betas of the 10 selected Dow Jones stocks. 
In other words, carry out an analysis using similar steps to MFE (2015, Example 6.34). 

b) Comment on the quality of the resulting factor model. 


See the dataset DJ of the R package qrmdata for daily values of the Dow Jones 30 index. 


Exercise 6.25 (Principal components analysis of equity return data) 

Take the daily return data used in Exercise 6.23. 

a) Carry out a principal components analysis and determine the proportion of the total 
variability of the data that is accounted for by the first two and the first three principal 
components. In other words, carry out an analysis using similar steps to MFE (2015, 
Example 6.36). 
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b) Determine the factor loadings for the first two principal components and comment on 
whether they have clear interpretations. 


c) Construct time series of the first two principal components and plot them. 


Advanced 


Exercise 6.26 (Equicorrelation matrix) 

Let X = (X1,..., Xa) with oj = var(X;) € (0,00), for j = 1,...,d, and a correlation matrix 
P which is an equicorrelation matrix with off-diagonal elements equal to p. 

a) Show that P is positive semi-definite. 

b) Show that p > —1/(d — 1). 


Exercise 6.27 (On the invertibility of the sample covariance matrix) 


Let Su € RIX? be the unbiased sample covariance matrix computed from a random sample 
X31,..., Xn of dimension d. 


a) Show that S, can be written as 


1 


n— 1 


Sy = X'(In — Jn/n)X, 
where X = (X},...,X/)’ € R"*4 and J, = 11' € R?** for 1 = (1,...,1) € R”. 
b) Show that rank(Z, — Jn/n) — n — 1. 


c) Hence show that S,, has rank at most min{n — 1, d) and conclude that Su is not invertible 
if n <d. 


Hint. You may use that rank(AB) € min{rank A, rank B} and other identities for the rank of 
matrices. 


Exercise 6.28 (Characterization of the multivariate normal distribution) 


a) Show that two d-dimensional random vectors X,Y are equal in distribution if and only if 
a/ X 5 a/Y for all a € RÊ. 


b) Show that X ~ Na(p, X) if and only if a/X ~ N(a/p, a/Xa) for all a € R4. 
m 


Exercise 6.29 (Conditional distribution of multivariate normal) 
Let X ~ Na(p, X) with rank X = d. For some k € (1,...,d — 1), write X = (X1, X2), where 


X4, € R*, X5 € R-*. Furthermore, let u = (p^, ph) and Xi = x =) Show that 


Xo |X, = 2 ~ Na-k( H21, X221); 


where 131 = p2 + Ea Xj (21 — p1) and X231 = X23 — Yai hy X12. 
Hint. Consider Y = Xə + AX, for A= -5a E. 
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Exercise 6.30 (Density of multivariate t) 
Let X = u-- VW AZ for u € Ra, A € RÌ? with rank A = d and Z = (Zi,..., Za) ~ Nq(0, Ia) 
independently of W ~ Ig(v/2, v/2) for v > 0. In other words, X ~ ta(v, p, X), where X = AA’. 


a) Show that X has density 


; x c R4. 


I'((v + d)/2) ( | (z -uE (z - py em 
(v/2) (nv)? /det Y; 


b) By using Stirling’s Formula, that is r(x + 1) ~ V27a(a/e)* for x — oo, show that fx 
converges to the density of Na(p, X) as v > oo. 


fx(x) = 7 


Exercise 6.31 (Conditional distribution of bivariate t) 
c? 010 H 

Let (X1, X2)  to(v, u, X) with X = E en n 2 for 01,02 > 0 and p € (—1,1). By using the 
010 05 

density of a multivariate t distribution, see Exercise 6.30, show that 


v+ ((xı = aie 
v+1 ; 


(ox 
X| Xi mp ~ (v+ 1, pe tem — m), o3(1 — p?) 


Exercise 6.32 (An interpretation of the principal components transform) 

Let X be a random vector with E(X) = 0 and cov(X) = X. Let Y be the vector given 
by the principal components transform of X. In other words, Y = I’X where X = TAT”, 
the columns of T contain the orthonormal eigenvectors of © and A = diag(A1,..., Aq) where 
A1 > + > Aq = 0 denote the sorted eigenvalues of X. The jth principal component Y; is thus 
Y; = X , where y; is the jth column of T (the eigenvector corresponding to the jth largest 
eigenvalue of ©), also known as jth vector of loadings or jth principal axis. 


a) Show that the first principal component Y, of X satisfies 


var(Y;)= max  var(a/X), 
a€R4, |[a|| 1 


that is Yı is the standardized linear combination a/X of X with maximal variance among 
all linear combinations. 


b) Show that the second principal component Y> of X satisfies 
var(Ya) 2 max  var(a/X), 


a€R", |[a[-1, 
a/*1—0 


that is Y> is the standardized linear combination a/X of X with maximal variance among 
all linear combinations orthogonal to the first principal axis. 


Hint. Apply the method of Lagrange multipliers. It may also be useful to use the facts that 
ag = $-z'a— a and $x'Ax = (A+ Az. 
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Review 


Exercise 7.1 (The concept of a copula) 
Give a definition of a copula. 


Exercise 7.2 (The importance of Sklar's Theorem) 

State Sklar's Theorem, giving both the main statement and its converse. Explain the relevance 
of the result from the point of view of studying dependencies between random variables and 
constructing multivariate models for applications. 


Exercise 7.3 (On the importance of the three defining properties of copulas) 

Show that the function W(w) = max(?75., uj — d+ 1,0) fulfills the following properties but is 
not a copula for d > 3: 

i) Wí(u;,...,uq) =0 if uj = 0 for any j; 

m W(1,...,1,uj,1,...,1) = u; for all j € {1,...,d} and u; € [0,1]; and 


iii) W is increasing in each component. 


Exercise 7.4 (Special cases of normal variance mixture copulas) 

Consider a d-dimensional normal variance mixture copula (such as Gauss or t) with off-diagonal 
entries of the correlation matrix P all equal to p € [-1, 1]. What models do we obtain in the 
following special cases? 


a) p= 
b) p=0 
c) p=—-landd=2 
d) p= —1 and d > 3 


Exercise 7.5 (Correlation and rank correlation) 
Describe some drawbacks of Pearson’s linear correlation coefficient that are not shared by rank 
correlation coefficients (such as Spearman’s rho and Kendall’s tau). 


Exercise 7.6 (Maximal VaR of the sum) 

Consider the class of joint distributions of the pair (X1, X2) when the marginal distributions 
are fixed and the copula is allowed to vary. Assume that the marginal distributions are such 
that X4 and Xə have finite variances. For each of the following statements, state with reasons 
whether they are true or false. 
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a) p(.X1, X2) is maximal when (X1, X2) has the comonotonicity copula M. 
b) The variance of X, + X» is maximal when p( X1, X5) is maximal. 
c) For fixed a € (0,1), VaRa(X1 + X2) is maximal when o( X1, X2) is maximal. 


Exercise 7.7 (VaR for the sum of two normal risks) 

A risk manager makes the following assertion: “If X1 and X» are normally distributed risks 
with correlation p, then I know the value of VaR4(X4 + X2) for any a”. Discuss whether you 
agree. 


Exercise 7.8 (Distinguishing meta-distributions from scatterplots) 
All the samples shown in Figure E.7.1 come from multivariate distributions with standard 
normal margins and differing underlying copulas. Match the pictures to the following copulas. 


a) a Gauss copula; 

b) a t copula with v = 3.5 degrees of freedom and positive correlation parameter; 
c) a t copula with v = 3.5 degrees of freedom and negative correlation parameter; 
d) a t copula with v = 3.5 degrees of freedom and zero correlation parameter; 

e) a Gumbel copula; 

f) a Clayton copula; 

g) an elliptical copula with bounded radial part; 

h) a copula with singular component; 


i) the copula of a normal mean-variance mixture. 


Exercise 7.9 (Estimation of a copula) 
Explain how a copula can be estimated from multivariate data without making specific 
parametric assumptions about the marginal distributions of the data. 


Basic 


Exercise 7.10 (Extracting the copula from a joint distribution) 
Suppose that d financial losses have the joint distribution function 


Faye ew(-(X«*) E x € (0,00), 0 € [1, 00). 
rx 


a) Derive the marginal distributions of F and give their common name. 
b) Hence derive the copula C of F and give its common name. 


c) Assume that d — 2 and 0 — 2. Calculate the probability that both losses jointly take values 
above their 95th percentiles and show that this is almost exactly 12 times larger than the 
case where the losses are independent. 
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Figure E.7.1 1000 iid samples from 
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-4 


various bivariate distributions with N(0,1) margins. 
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Exercise 7.11 (Copula of a normal random variable and its absolute value) 
Derive the copula of Z ~ N(0, 1) and |Z]. 


Exercise 7.12 (Minimal correlation for standard exponential random variables) 

Let (X1, X2) be a pair of standard exponential random variables with the countermonotonicity 

copula and let U be a standard uniform random variable. 

a) Find a stochastic representation of the form (X4, X2) = (f(U), g(U)) for functions f and g 
which shows how (X1, X2) can be simulated. 


b) With the help of a), use software and numerical integration to calculate the minimal 
correlation for two standard exponential random variables. 


Exercise 7.13 (Maximal correlation for Pareto Type | random variables) 
Compute the maximal correlation for a pair of random variables (X4, X2) with Pareto Type I 
marginal distribution functions of the form F;(z) = 1 — 1/25, > 1, 0; > 2, j € (1,2). 


Exercise 7.14 (Copulas and correlation ordering) 
Let (X1, Y1) have copula Cı and (X5, Y2) have copula Co. Assume moreover that X, = X 


and Y; = Yo and that the marginal distributions of X; and Y; have finite second moments 


for j € {1,2}. Use Hóffding's Lemma to show that C1(u) < C»(u), u € [0, 1]?, implies that 
p(Xy, Yi) € p(Xa, Y2). 


Exercise 7.15 (Copula-based dependence measures under simple transformations) 

a) Let (X1, X3) be a random vector with continuous margins. Compute ps(— X4, X3) in terms 
of ps Xi, Xə). 

b) Let (U1, U2) ~ C for some copula C with upper tail dependence coefficient Au. What is the 
upper tail dependence coefficient of the copula of (U1, U2)? 


Exercise 7.16 (Archimedean generator properties) 
Let v : [0, o0) — [0,1] be a generator of a d-dimensional Archimedean copula C. 


a) Show that lim, ,55 Y(t) = 0 must hold. 

b) Show that (0) = 1 must hold. 

c) Show that v(ct), t > 0, generates C for all c > 0. 

d) Show that the lower tail dependence coefficient of the bivariate copula satisfies Àj — 
A v(2t) 
limo MON 

Exercise 7.17 (Clayton copulas) 


a) Show that the function Y(t) = (1 + t)-V/9, t > 0, for 0 > 0 generates an Archimedean 
copula in all dimensions d > 2 and that the resulting copula is the Clayton copula 


—1/0 
Colu) = ( uz’ -d+ 1) , uc [0,1], 0 » 0. 


d 
J= 


1 
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b) Compute the lower tail dependence coefficient A; of the bivariate Cg. 
c) Show that Cg converges to the independence copula as 0 — 0. 


d) Show that Cg converges to the comonotonicity copula as 0 — oc. 


Exercise 7.18 (Outer power Archimedean copula) 

Let Y be an Archimedean generator which generates an Archimedean copula C. The correspond- 
ing outer power Archimedean copula is the Archimedean copula generated by (t) = w(t!/?), 
B € [1,00). This provides a convenient method for introducing an extra parameter into an 
Archimedean copula and can give more flexibility for modelling tail dependencies in both upper 
and lower tails. 


a) Show that Kendall's tau of the outer power Archimedean copula generated by ij is given by 


pr —1-(1— p:)/8, 


where p, denotes Kendall’s tau of the Archimedean copula C generated by wv. 


b) Show that the coefficients of lower and upper tail dependence of the outer power Archimedean 
copula generated by w are given by 
!(91/8 !(91/8 
=e ai, Ee i a2 A P 2 79 
Ay = 2 jim. "O and Ay=2-2 lim wy” 


respectively. 

c) Let y(t) = (1+ t)-!/?, 0 € (0,00), denote the Clayton generator (with Kendall’s tau 
pr = 0/(0 *- 2)). Compute Kendall’s tau and the coefficients of tail dependence of the outer 
power Clayton copula generated by v(t) = v(t!/?), 8 € [1, 00). 


Exercise 7.19 (Farlie-Gumbel-Morgenstern copula) 
For 0 € [-1, 1], let 


C (ui, ua) = uu» +4 (1 ui)(1 u3)u4ua, u1, U2 € (0, 1]. 


Show that C is a copula. 
Show that C is exchangeable. 
c) Show that C is radially symmetric. 


d) Compute Spearman’s rho for (U1, U2) ~ C. Considering the range of ps, what drawback 
does the Farlie-Gumbel-Morgenstern copula family have for statistical applications? 


e) Compute the coefficient of lower tail dependence Aj. Without further calculation, write 
down the coefficient of upper tail dependence Au. 


Exercise 7.20 (Kendall's tau as a correlation) 
The Kendall's tau of two continuously distributed random variables X1, X2 can be shown to be 
equal to 


pr(X1, X2) = AP(X4 < X1, X < X1) - 1, 
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where (X1, X5) denotes an independent copy of (X1, X2); see MFE (2015, equation (7.29)). 
Show that Kendall's tau can be expressed in terms of Pearson's correlation coefficient as 


pr(X1, X2) = P(X € X3), I(Xo < X2)). 


Sol. p. 209 Exercise 7.21 (Blomqvist's beta) 
Blomquist/s beta is a further copula-based measure of dependence. For random variables X, 
and X» with continuous distribution functions P, and F> it is defined by 


pg (Xi, X2) = P((X1 — mi)(X2 — ma) > 0) — P((X1 — m1)(X2 — ma) < 0), 


where m; = Ff (1/2), j € {1,2}. 
a) Show that pa(.X1, X2) can be expressed in terms of the copula C of (X1, X5) alone. 
b) Show that pa( X1, X2) can be expressed as 


C(1/2,1/2) - (1/2, 1/2) 
M(1/2,1/2) -H1/2,1/2)' 


pa X3, X2) = 


where IT denotes the independence copula and M the comonotonicity copula. 


c) Show that pg(X1, X2) can be expressed in terms of Pearson’s correlation coefficient as 
pa(X1, X2) = p(1(X1 < m), I(X2 < mz)). 


Sol. p. 210 Exercise 7.22 (Convex combinations of copulas) 
Let C1 and C» be two d-dimensional copulas and define, for y € [0, 1], 


C(u) = 4C4(u) + (1 —y)Co(u), we [0,1]7. 


Taking convex combinations in this manner provides a further way of expanding the set of 

copulas available to risk modellers. 

a) Show that C is a copula by verifying the three characterizing properties of a copula in MFE 
(2015, Section 7.1.1). 

b) Suppose that we can generate random vectors U} ~ C1 and U2 ~ C5. By recognizing that 
C is a mixture model, write down an algorithm for generating a random vector U ~ C and 
verify its correctness. 

c) For d= 2, show that the Spearman's rho pg of C is given by ps = yps,1 + (1 — 7) ps,2, where 
ps,; denotes the Spearman's rho of Ck, k € (1,2). 


d) For d = 2, show that the Kendall’s tau p; of C is given by 


pr = Pra + 23(1 — y) (4E(C1(U2)) — 1) + (1 — 9)* pra; 


where pr, denotes the Kendall’s tau Ck, k € {1,2}. 
Hint. You may use the fact that E(C2(U1)) = E(C4(U3)). 
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Exercise 7.23 (Bivariate Marshall-Olkin copulas) 
Common shock models have applications in insurance modelling and credit risk modelling. In 
the latter context consider three firms and for j € {1,2,3} let T; denote the time to bankruptcy 
for firm j due to reasons that are exclusive to each firm (for example bad management). Assume 
moreover that firms 1 and 2 are suppliers to firm 3 and will both fail immediately if firm 3 fails 
first. 

Let Tj ~ Exp(Aj), j € {1,2,3} and assume that T1, T2 and T3 are independent. The actual 
times to bankruptcy for firms 1 and 2 are given by 


Xj = min{T;, T3}, Vie {1,2}. 
The survival copula of (X1, X2) is a Marshall-Olkin copula; see Marshall and Olkin (1967). 
a) Show that the survival copula of (X1, X2) is given by 


C (u1, u2) = min(ujul ??, up uo}, U1, U2 € [0,1], 


where a; = A3/(Aj + A3), j € {1,2}; note that one typically includes the boundary cases 
oj € {0,1}, j € {1, 2}, in this representation. 
b) Let (U1,U5) ~ C. Find a stochastic representation of the form (U;,U2) = (fi(V1, Va), 
ind. 


f2( V5, V3)) for Vi, V2, V3 ^ U(0,1) and functions fı and fə that you should determine. 
c) Compute the tail dependence coefficients Aj, Au for (U1, U2) ~ C and any 01,02 € [0, 1]. 


Exercise 7.24 (Sampling copulas and meta-distributions) 


Use statistical software to reproduce and extend the analyses in MFE (2015, Example 7.13). 


In other words consider the following four bivariate copulas: 
i) the Gauss copula with parameter p — 0.7; 

ii) the Gumbel copula with parameter 0 — 2; 

iii) the Clayton copula with parameter 0 — 2.2; and 

iv) the t copula with parameters v = 4 and p = 0.71. 


a) Generate random samples of size 2000 from the four copulas and produce scatterplots. 


b) Generate and plot samples from meta-distributions based on the four copulas above combined 
with standard normal margins. 


c) The copula parameters were chosen to give meta-distributions with a linear correlation of 
approximately 7096. Now try to generate samples from meta-distributions which combine 
the four copulas above with unit exponential margins. Change the copula parameters for 
the Gumbel, Clayton and t copulas so that they all have the same value for Kendall’s tau 
as the Gauss copula. 


The R package copula can be used to answer this question. 
Exercise 7.25 (Fitting copulas to equity return data) 
Take the same data that were analysed in Exercise 6.23, in other words daily log-return data 


from 2005 to 2012 for the following 10 components of the Dow Jones index: Apple, Cisco, 
Disney, IBM, Intel, McDonald's, Microsoft, Nike, Proctor & Gamble, and Walmart. 
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a) Fit both the Gauss copula and the t copula to the log-returns of these data using the pseudo 
maximum-likelihood approach to take care of the unknown marginal distributions. What 
are the fitted copula parameters? 


b) Compare the Akaike information criteria (AIC numbers) for the two models and confirm 
that the £ copula is clearly the superior model for the dependence structure. 


c) Repeat the analysis for monthly log-return data. 


The R package copula contains fitting functions for these copulas. 


Advanced 


Exercise 7.26 (The conditional copula of a Clayton copula) 

Let (U1, U2, U3) be distributed according to a Clayton copula with parameter 0 > 0. For 
u3 € (0,1), derive the copula of (U1, U2) | Us = us, that is the conditional copula of (U1, U2) 
given that U3 = us. 


Exercise 7.27 (A flexible bivariate copula construction) 
In this exercise we study a flexible copula construction that can be used to build copulas with 
a variety of different properties. 

Let gi, g2 : [0,1] — [0,1] be continuously differentiable functions satisfying g;(0) = g;(1) = 0, 
j € (1,2), and gj (ui)95(u2) > —1 for all u1, u2 € [0,1]. Consider 


Cui, u2) = uru + gi(ur)ge(u2), U1, ua € [0,1]. 


) Show that C defines a copula. 
) Compute p = p(U1, U2) = ps(U1, U2) for (U1, U2) ~ C. 
) Show that p = 0 if g1 or g2 are point symmetric about 1/2 (meaning that g;(u) = —gi(1— u)). 


o c & 


e 


) Hence construct a parametric copula family satisfying p — 0 for all its members. 


) Show that C is radially symmetric if gı and g2 are symmetric about 1/2. 


m 


) Hence construct a copula that is radially symmetric but non-exchangeable. 


Exercise 7.28 (The Liebscher construction) 
Let C4, and C2 be two copulas and (U1, Vi) ~ C4 and (U2, V5) ~ C2 two independent pairs of 
random variables. For a, 8 € (0, 1), define 


Kes mary, UI 
i= max{ V’, yT, 


a) Show that the copula of (X1, X2) is given by 
a PB l-a , 1-8 
C (u1, u2) = Ci(ut, ug )Co(uy ^,u5 ^), uu» € [0,1]. 


b) Compute the four limiting copulas as (a, 3) tends to (0,0), (0,1), (1,0) and (1,1). 
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c) For a= £, compute the coefficient of lower tail dependence A, of C in terms of the coefficient 
of lower tail dependence Ai; of Cj, j € {1,2}. 


Exercise 7.29 (The Rosenblatt transformation and the conditional sampling method) Sol. p. 216 
For a d-dimensional copula C, U ~ C, j € {2,...,d} and u1,...,uj—1 € (0,1), one can define 
the conditional distribution 


Cin,....j-1(uj | U1,--- pt) = P(U; < uj | Uj—u..., Uj-1 = uj—1), uj € (0, 1], 


of C. It can be shown that if C admits continuous partial derivatives with respect to the first 
j — 1 arguments, then 


j-1 ; 
Bur su CO (us. vus) 
C, -1(u5 | v, ..., uj-1) = — i GRRE EET 5 (E1) 
gg agg C 79 D (ui... 5-1) 
where Cll) (uz, ..., u) = C(wm,...,uj,1,...,1) denotes the marginal copula of C corre- 


sponding to the first 7 arguments (with the convention that C (uz) Ex). 
The transformation Rc : (0, 1)7 — (0, 1)7 defined by Ro(u) = u’ with 


/ 
Uy = U1; 


uz = Coj (uo | ui), 


u4 = Colina d=i Ua [1154 1); 


is known as the Rosenblatt transformation (Rosenblatt (1952)). If U ~ C, it can be shown 
that Rc(U) follows the independence copula II and this fact is used in goodness-of-fit tests; 
see Genest et al. (2009). 

Consider the corresponding inverse transformation RẸ : (0, 1) — (0, 1)? defined by RẸ (uw) = 
u with 


/ 
uj = Uu, 


ug = Có (uz | ur), 


Ug = Cán. a1 (va | u1,..- pei) 


where Saj- denotes the quantile function of uj — Cjn, j-1(uj |ui,---,uj—1). One can 


show that if U’ ~ II, then R&(U’) ~ C. This inverse Rosenblatt transformation can be used 
for sampling from C and the corresponding algorithm is known as the conditional distribution 
method. 

a) Compute Cji... j-1 and Cin ed for an Archimedean copula with d-times continuously 


differentiable generator 7 satisfying w(t) > 0 for all t € [0, oc). 
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b) Compute C... and C sea for the Clayton copula. 


Exercise 7.30 (Archimedean Marshall-Olkin copulas) 
Let (W1, W2, W3) ~ Co where Co is an Archimedean copula with generator vo(t). Let 


X; = min(T;, Ti2}, j € {1,2}, 


where (Ti, T2, Ti2) = (— log W1, — log W2, — log W3) . Show that the survival copula C of 
(X1, X2) is given by 


C(u, u2) = Co(Go(u1), Go(ua), Go(min(u;, u2})), u1, U2 E (0, 1], 
where Go(u) = polg 1 (u)/2), u € [0,1]. 


Exercise 7.31 (Axioms of concordance) 

Spearman's rho and Kendall’s tau are measures of concordance. According to Scarsini (1984), 
such measures should fulfill a set of desirable axioms. Let (X,Y) be a pair of random variables 
with continuous marginal distribution functions and joint distribution function F. Let the 
measure of concordance be &(F) = &(X,Y). The axioms are: 


i) (Domain) &(X, Y) is defined for every such pairs (X,Y); 

ii) (Range) —1 € k € 1 with &(X,Y) = —1 for countermonotonic random variables and 
K(X, Y) = 1 for comonotonic random variables; 

ii) (Symmetry) &(X, Y) = «(Y, X); 

iv) (Independence) If X and Y are independent, then &(.X,Y) = 0; 

v) (Change of sign) K(—X,Y) = —&(X,Y); 

vi) (Coherence) If (X1, Yi) and (X2, Y2) are pairs of random variables with copulas C1 € C» 
ordered pointwise, then &(.X1, Y1) € K(Xa, Yo); 

vii) (Continuity) If (F5,),ew is a sequence of continuous bivariate distribution functions which 
converges pointwise to the continuous distribution function F, then limp. K(Fn) = &(F). 

a) Verify these axioms for Spearman's rho. 

b) Provide an example which shows that &( X, Y) = 0 does not imply independence of X,Y. 


Hint. Consider the uniform distribution on the unit circle. 
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Exercise 8.1 (Acceptance sets) 
Let o be a monotone and translation-invariant risk measure. 


a) Provide a mathematical definition of the acceptance set A, corresponding to o. 
b) How are convexity and coherence of o related to the geometric properties of A,. 
c) Describe A, for o = VaRa, o € (0, 1). 


Exercise 8.2 (Dual representation of ES) 
a) State the dual representation of expected shortfall ES4, o € [0, 1). 
b) By definition, ES, is increasing in a. How can this be seen from the dual representation? 


c) Use the dual representation to show that ES, is subadditive. 


Exercise 8.3 (Distortion) 
a) Give the definition of a distortion risk measure. 
b) Explain how distortion risk measures can be represented in terms of expected shortfall. 


c) Use this representation to show that distortion risk measures are comonotone additive and 
coherent. 


Exercise 8.4 (Subadditivity of VaR for jointly normal risks) 
Suppose X = (X4,..., X3) ~ Na(p, X) and let M denote the space of all risks of the form 


L=XX = 3mm Aj Xj. Show by calculation that VaR is subadditive on M for all a € [1/2, 1). 


Exercise 8.5 (Maximal ES of the sum) 
Let Lı  F; and Lə ~ Fə with finite second moments. Explain whether or not ES,(L1 + L3) 
is maximized when the correlation between Lı and Lə is maximal. 


Exercise 8.6 (Revisiting Markowitz) 
Consider linear portfolios in the set 


M={L:L=NX, AER}, (E1) 


where X is a fixed d-dimensional random vector of risk factors defined on some probability 
space (Q, F, IP), and where A represents portfolio weights. Denote by L(A) = 2 95 Aj; X; the 
loss generated by the portfolio weights A and assume that X has an elliptical distribution with 
finite variance. Fix some o > 0.5 and suppose that VaR, is used as a risk measure. 
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8 Aggregate Risk 


Consider the problem of finding the VaR,-minimizing linear portfolio in the set of all 
portfolios whose expected loss is equal to a given constant c, that is consider the problem 


min VaRa(L(A)) such that E(L(A)) = c. 
AER? 


Explain why the solution A* of this problem is identical to the classical Markowitz portfolio, 
that is to the solution of the problem miny¢pa var(L(A)) such that E(L(A)) = c. 


Exercise 8.7 (On correlation adjusted summation in risk aggregation) 
Let EC;,..., EC; denote (economic) capital amounts for d business lines of a financial firm 
with associated losses Li,..., L4 and let L = 214 Lj denote the total loss. 


a) Discuss strengths and weaknesses of the simple summation formula 
d 
EC = V EC; 


j=l 


and of the correlation adjusted summation formula 


d d 
EC = 5 5 Pij EC; EC;, 


i=1 j=l 


for risk aggregation; here pi; € [0,1], 1 < i, j € d, are parameters with py = 1 and pij = pji 
that are usually referred to as ‘correlations’. 

b) If risk is measured with o = ES,, is simple summation always conservative in the sense of 
providing an upper bound for o(L)? What about o = VaRq? 


c) Take d = 2 and suppose that Lj is lognormally distributed with parameters uj and cj, 
j € (1,2). Suppose that the firm decides to use correlation adjusted summation with some 
parameter p = 012 € (0, 1]. Does the interpretation of p as correlation of Lı and Lz always 
make sense? 


Exercise 8.8 (Economic properties of a capital allocation principle) 

Consider an insurance company with d business lines, producing random losses L1,..., La, 

respectively, so that the total loss is L = Ya Lj. Let o be a positive-homogenous risk measure 

and let o(L) be the risk capital for the entire company. In this context, a capital allocation 
principle allocates the economic capital amounts AC,,...,AC 4 to the respective business lines. 

a) Explain briefly why capital allocation principles are used in risk adjusted performance 
measurement. 

b) Give the definition of the Euler capital allocation principle in the above context and explain 
why in the above context RORAC-compatibility and the existence of a diversification benefit 
(two properties of the Euler capital allocation principle) are desirable properties of a capital 
allocation principle. 
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Exercise 8.9 (Acceptance sets for generalized scenarios) 
Let Q = {w1, w2} and let Q1, Qo be probability measures on Q such that Qı ({w1}) = 0.6 and 


Q2({w1}) = 0.3. Consider the generalized scenario risk measure o(L) = max{E®1(L), E®2(L)}. 


By identifying a random variable L with a vector | = (L(w1), L(w2)) we can identify the 
acceptance set of o(L) with a subset of R?. Describe this set. 

Exercise 8.10 (Convexity of coherent risk measures) 

a) Show that every coherent risk measure o : M — R is a convex risk measure, that is o 


satisfies 


pO EL m AES <li = ot Pays. ee [0,3] E Lace ME: 


b) Show that for positive-homogeneous risk measures convexity and coherence are equivalent. 


c) Show by means of a counterexample that without positive homogeneity the statement in b) 
is false. 


Exercise 8.11 (Standard deviation principle as a risk measure) 
For a random variable L with finite second moment, suppose that we define the risk measure 


o(L) = E(L) + ksd(L) where k is a positive constant and sd denotes standard deviation. 


'This risk measure is known as standard deviation principle. Show that this risk measure is 
translation invariant, subadditive and positive-homogeneous, but not monotone in general. 
Hint. For a counterexample for monotonicity let L4 — 0 and suppose that, for sufficiently large 
n, 


es 0 | with probability 1 — 1/n, 
? l-1 with probability 1/n. 


Exercise 8.12 (Standardized and strictly convex risk measures) 
Consider a strictly convex risk measure o that is standardized so that o(0) = 0. Show that for 
Aà > 1 it holds that o(AL) > Ao(L) and give an economic interpretation of this inequality. 


Hint. A convex risk measure o is strictly convex if for Lı and Ly in the domain of o with 
Li # Lə and A € (0,1) one has o( MA +(1- X)L2) < Ao(L1) + (1 — A)o(L2). 


Exercise 8.13 (Monotonicity of expectiles in the confidence level) 
Show that the a-expectile e4(L) of a loss L with E(|L|) < oo is increasing in the confidence 
level a. 


Exercise 8.14 (Expectiles of the standard uniform distribution) 
a) Show that the a-expectile e4(L) of a loss L with E(|L|) < oo satisfies 


e (L) = E(L) + “OECO - y)*) = EC) + EU - 9)), 
where xt = max{z,0} and z^ = max{—z, 0}. 
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8 Aggregate Risk 


b) Now assume that L ~ U(0, 1). 

i) Compute e4(L) using a). 

ii) Compute e4(L) using MFE (2015, Proposition 8.23). 
c) What is ea(L) in the case where L ~ U(a,b) for a < b. 


Exercise 8.15 (Expectiles for exponential and Pareto risks) 

a) Compute e4(L) for L ~ Exp(A), A > 0. 
Hint. You may use the Lambert W function, defined as the inverse of the function z > 
z exp(x). 

b) Compute e4(L) for L ~ Pa(2, x), k > 0. 


Exercise 8.16 (Stress test risk measure) 

Consider a set of potential losses of a financial institution which are of the form L = l(a) 
where x denotes a d-dimensional vector of risk-factor changes and where / is the risk mapping 
associated with the loss L. Fix a set S C IR? of possible risk-factor changes (the scenarios). A 
stress test risk measure is then given by 


os(L) = sup(!(z) : æ € S}, 
that is the worst loss over all scenarios which belong to S. Show that os(L) is coherent. 


Exercise 8.17 (Euler principle for multivariate normal risks) 
Assume that (L4,..., La) ^ N4(0, £). For the risk measure o = ES, compute the Euler capital 
allocations AC;, the so-called expected shortfall contributions. 


Hint. You may use the fact that ES,(L) = w+ c HE) for L ~ N(u, 0). 


Advanced 


Exercise 8.18 (Dual representation of convex risk measures) 

This exercise is concerned with the computations in MFE (2015, Example 8.12) which illustrates 
the proof of the existence of a dual representation for a convex risk measure. Consider (Q, F, P) 
with Q = (w1, w2}, F the power set of Q and P({w,}) = 0.5 for k € {1,2}. Consider the convex 
risk measure o(L) = logE(e”). For a loss L satisfying o(L) = 0 construct the probability 


measure Qr, in MFE (2015, Theorem 8.11, Step 2). 
Hint. Show that Ap = ((£1, £2) : l2 € log(2 — e^)). 


Exercise 8.19 (Representation of ES) 
For an integrable loss L with distribution function FT, expected shortfall may be written as 
E(LItrsgr(a))) + Ff (0)(1 — o — Fr(Ff (o))) 


ES,(L) = = (E1) 


see MFE (2015, Proposition 8.13). In this exercise we give several related representations for 
ESa. 
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a) Define the tail value-at-risk (TVaR.) (also known as conditional tail expectation (CTE)) at 
confidence level a € (0,1) by 


TVaR4(L) = E(L| L > VaRa(L)). 


Show that 


ES, (1) = TVaRs (L) EEEO) 5 var, (2) (.- > 


and interpret this formula. 
b) Let 


ae o ER P(L =q) =0, 
{L>q} Titsq + TAA Ira), P(L = q) > 0. 


Show that ES, allows for the representation 


er 
(LI. re (ay) 


l-a 


ESS(L) = 


Exercise 8.20 (CoVaR and systemic risk) 

Let Lı and Lz be two continuously distributed losses interpreted as either losses of two firms 
in a financial system or of one firm and the financial system as a whole. The conditional 
value-at-risk CoVaR of Lə given L4 at confidence level o € (0,1) is a risk measure defined by 


CoVaRo,(L2 | L1) = Figjr4>varg(L1)(@); 


where Fy.)7,>VaR,g(L1)() = P(Le € z| L1 > VaRg(L1)). The systemic risk contribution of Ly 
to Lə at confidence level a € (0,1) is defined by 


A CoVaR, (La | Li) = CoVaRo,a(La | Li) — CoVaRa,o.5(L2 | Li). 


a) Interpret CoVaR, (L3 | L1) and A CoVaRa(L» | L1). 

If (L1, L2) ~ F with margins F1, F2 and copula C, find an expression for Pita vetus ia): 

c) Derive an expression for CoVaRg,3(L2|L1) for C(u) = yM(u) + (1 — 7) (u), u € [0, 1], 
where M and II denote the comonotonicity and the independence copulas and y € [0, 1]. 
Comment on the special cases y € {0,1}. 


S 


d) Under the setup of c) and for L2 ~ Exp(A), A > 0, compute the systemic risk contribution 
A CoVaRA(L» | L1), a € (1/2, 1). Comment on the special cases y € {0,1}. 

Note. CoVaR and A CoVaR were introduced by Adrian and Brunnermeier (2016) but with 

conditioning event Lı = VaRg(L) instead of Lı > VaRg(Li). The former conditioning event 

can also be found in the definition of CoVaR in other literature. Mainik and Schaanning (2014) 

give reasons why the conditioning event Lı > VaRg(L1) is to be preferred. 
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Exercise 8.21 (Superadditivity of VaR under worst-case dependence) 
For o € (0,1) and L4 ~ U(0,1) let Lə be almost surely given by 


Ii, Li <a, 
fj 
1+a-— Lı, Li 2a. 


a) Show that Lə ~ U(0, 1). 
b) Let a € (0,1) and £ € (0, (1 — a)/2). Compute VaRa+e(Lı + L2) and compare it to 
VaRa+e(L1) + VaRa+e(L2). 


c) For e € (0, (1 — a)/2) compute the upper bound VaRa+e(S2) from MFE (2015, Proposi- 
tion 8.31) and relate this to the result of b). 


Exercise 8.22 (A proof of subadditivity of ES based on H6ffding’s Lemma) 
Let L be a random variable with E(|L|) < oo and recall that L admits the stochastic represen- 
tation L = Ff (U) where U ~ U(0,1) and Fy, is the distribution function of L. 


a) By considering Y = Iyysq} for o € (0,1), show the validity of the representation 


sup E(LY) : Y  B(1,1— a)}, 


ESS(L) = 
alL) l-a 
where the supremum is understood as being taken over all joint distributions of (L,Y) such 
that L has marginal distribution function Fr, and Y ~ B(1,1 — o). 


b) Conclude that expected shortfall is subadditive. 


Exercise 8.23 (A basic version of the rearrangement algorithm) 

Let Lı, Lo and La be losses from three business lines of a financial firm with continuous 
distribution functions F1, F> and F3 respectively. Assume the firm measures risk with VaR4(L) 
based on the total loss L = Lı + La + L3 for some confidence level o € (0,1). The following 
questions address elementary steps in understanding the rearrangement algorithm of Embrechts 
et al. (2013a) for computing lower and upper bounds on the worst value-at-risk VaRa(L), 
that is the largest VaR4(L) under Fi, F5, F3. For simplicity, the basic version below describes 
the necessary steps to obtain one such bound on VaR4(L) and we take it as an estimate for 
VaRa(L). 


a) Suppose the matrix 


wre A N 
NIH w 
Kw OO 


contains in column j € {1,2,3} the largest (1 — a)100% of the losses incurred in business 
line j. The vector of row sums (13,6,8, 14) of A can thus be viewed as realizations of the 
largest (1 — a)100% of the total loss L and the minimal number, 6, is thus an approximate 
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VaRa(L) estimate, denoted by VaRa(L). By rearranging the losses in each column of A, 
we can mimic what happens to VaRq(Z) under different types of copulas C of (L1, L2, L3) 
in the joint tail. 

Rearrange the losses in each column of A such that the rows of the rearranged matrix 
A mimic realizations obtained under the comonotonicity copula C = M. What is the 


corresponding VaR4(L) estimate VaRa( L)? How would you need to rearrange each column 
of A if C equals the independence copula II? 


An estimate VaR4(L) of worst value-at-risk VaRa(L) over all copulas C of (L1, Lo, L3) in 
the joint tail can be constructed by rearranging the entries in each column of A such that 
the minimum of the row sums of the rearranged matrix becomes maximal (see iii) below for 
an explanation). To rearrange A, iterate over the columns of A and oppositely order the 
current column with respect to the sum over all other columns; note that two vectors a 
and b are oppositely ordered if the smallest element of a lies next to the largest element 
of b, the second-smallest next to the second-largest and so on. Assume this basic version 
of the so-called rearrangement algorithm (RA) is terminated if every column is oppositely 
ordered with respect to the sum over all other columns. 


— 


i) Apply this basic version of the RA to A for computing the estimate VaRa(L) of worst 
value-at-risk VaRq(ZL). 
Hint. When rearranging column j, it is possible that the vector of row sums over all 
other columns contains ties, that is equal values. In this case, you are free to choose an 
order when rearranging the corresponding entries in column 7. 


ii) Compare your result with the case C = M from a). 


iii) Explain intuitively why each step of the RA can increase VaR4(L) and thus provide a 
way to approximate VaRa(L). 


Hint. Consider the case where all the losses are continuously distributed with finite first 
moment and suppose it is possible to rearrange the columns of A such that all row sums 
are (nearly) equal. 


Apply the basic RA as described in b) to the matrix 
111 
A= 22.72 
3.33 


for computing the estimate VaRa(L) of VaR4(L). Is the VaRa(L) obtained this way indeed 
the largest possible VaR4(L) estimate in this case? 
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Review 


Exercise 9.1 (Mapping and loss operator) 
Briefly explain in words what is meant by the mapping of a portfolio and explain the concept 
of a loss operator. 


Exercise 9.2 (Unconditional and conditional approaches to market risk measurement) 
Briefly explain the ideas behind the unconditional and conditional approaches to market risk 
measurement and summarize the pros and cons of each approach. 


Exercise 9.3 (Comparison of variance—covariance, historical simulation and Monte Carlo) 
List some advantages and disadvantages of the variance-covariance, historical simulation and 
Monte Carlo methods for measuring market risk. 


Exercise 9.4 (Historical simulation) 


a) Give a theoretical justification for the basic historical simulation method without volatility 
estimation. 


b) Explain why this is an unconditional approach. 


c) Sketch a method of extending historical simulation to a time-series context where volatility 
is changing. 


Exercise 9.5 (Elicitability and its relevance to model validation) 


a) Briefly explain the notion of elicitability for a law-invariant risk measure ọ. 


b) Explain why elicitability of a risk measure o is useful for comparing risk measure forecasts. 


c) Categorize the following risk measures according to whether they are elicitable or not: 


value-at-risk, expected shortfall, the expectile risk measure. 


Exercise 9.6 (VaR violations) 
a) Explain the concept of a violation of value-at-risk at level a. 


b) What would be the properties of the time series of VaRq violations produced by an ideal 
forecaster who always uses the correct conditional loss distribution to forecast losses? 


Exercise 9.7 (Factor models for bond portfolios) 


Describe the main differences between the Nelson-Siegel and PCA methods of building factor 
models for measuring the risk in a bond portfolio. 
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Exercise 9.8 (Mapping an interest-rate swap) 
A (forward-start) interest-rate swap (IRS) is a contract with future start date Ty where two 
parties agree to exchange payment streams at a set of times Tı <--- < T, with Ti > To. One 
party receives the amount Nó; K at time T; where N is the nominal value of the contract, K is 
a fixed interest rate and 6; is the length of the interval [T;..1, T;] expressed as a fraction of a 
year; in return the other party makes the floating payment Nó;L(T; 1, T;) where L(T; 4,T;) 
is the simply compounded spot interest rate at time 7; for the maturity time T; (this is 
typically the LIBOR rate). For simplicity we consider a nominal amount N — 1 and regular 
quarterly payments such that ó; — 1/4 for all i. 

Suppose the contract is agreed at time t = 0. The value of the contract at time 0 € t « To 
to the party receiving the fixed payments is given by the formula 


n 


V(t) = -p(t, To) + plt, Ta) + 9 vt Ti), 
i=1 


where p(t, T) denotes the price of a zero-coupon bond with maturity T at time t; see Brigo 
and Mercurio (2006, Section 1.5). 


a) How is K determined if the contract should be fair when it is agreed upon? 


b) Derive the mapping for the interest-rate swap in the discrete-time form V; = g(™, Zt) where 
Z, are risk factors which you should identify, 7; = t(At) and At is the length of the risk 
management time horizon. 


c) Hence find the linearized loss operator for the interest-rate swap. 


Exercise 9.9 (Mapping a currency forward) 

Denote by (e+) the exchange rate between two countries (foreign and domestic), that is at time 
T one unit of the foreign currency can be exchanged for er units of the domestic currency. As 
a concrete example, assume that e; is the USD/EUR exchange rate, so that the foreign bond is 
a US treasury bond and the domestic bond a Euro area bond. A currency or FX forward is an 
agreement between two parties to buy/sell a prespecified amount N of USD at a future time 
point T > t for a prespecified exchange rate € (and not at the market rate er). The future 
buyer is said to hold a long position, the other party is said to hold a short position in the 
contract. 


a) Explain why the value of the contract at maturity T is equal to Vr = N(er — e) (from 
the viewpoint of the party holding a long position), and show that a long position in the 
forward is equal to a portfolio of N US treasury bonds p'(t, T) and —Ne units of the Euro 
area zero coupon bond p*(t, T). 

b) Derive the mapping for the currency forward using as risk factors the yield to maturity of 
the Euro area and of the US treasury bond and the logarithmic exchange rate. 


c) Compute the linearized loss operator for the mapping derived in b). 
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Exercise 9.10 (Asset liability management using duration and convexity) 

Consider a pension fund which makes payments of 1M at the end of each of the next five years, 
that is for T € T = {1,2,3,4,5}; the current time is t = 0. Assume that the yield curve is flat 
with yield to maturity y(0, T) = 1% for T € T. Suppose that the capital of the fund today is 
exactly equal to the present value of the liabilities. The fund manager compares the following 
two strategies: 


1) Perfect matching: Here the manager buys for each maturity T € T zero-coupon bonds with 
nominal value 1M. 


2) Immunization with maximal convexity: Here the manager invests only in zero bonds with 
maturity 1 and 5. The nominal values A; and A5 are determined from the condition that 
in t = 0 the value of the assets should be equal to the value of the liabilities and that the 
duration of the cash flows on the asset side should be equal to the duration of the cash 
flows on the liability side. 


a) Use numerical root finding to show that Ay + 2.48M and A5 e 2.53M for Strategy 2. 
b) Compute the convexity of both strategies. 


c) Compute for both strategies the change in the overall value of the position (assets — 
liabilities) if i) all yields rise to 2% and ii) all yields drop to 0.1%. 


d) In view of your results in b) and c), comment on the claim that Strategy 2 (which has 
higher convexity) outperforms Strategy 1. 


Exercise 9.11 (Variance-covariance method for multivariate t risk-factor changes) 
Suppose that the vector of risk factor changes satisfies X = (X1,..., Xa) ~ ta(v, p, X). 


a) Explain why this might be a better model for market risk than a multivariate normal 
distribution. 


b) Suppose that the linearized loss L^ = Ao + AX is a sufficiently accurate approximation of 
the true loss. Use Exercises 6.20 a) and 2.16 to derive formulas for VaR4(Ao + A X) and, 


for v > 1, ESg (Ao + A X). 


c) Use b) to calculate VaRo.99(L2,) and ESo.99(L2) for the linearized portfolio loss over a 
one-day horizon for a portfolio of two stocks with portfolio weights w; = 0.7 and w = 0.3 
and a value today of V; = 1 (in M EUR). You should assume that the log-returns are 
bivariate t distributed with annualized volatilities c4 = 0.2, o2 = 0.25, correlation p = 0.4 
and degrees of freedom v — 5. Assume that the expected returns are 0 and that one 
year consists of 250 trading days. Use the square-root-of-time scaling to compute daily 
volatilities. 


YS 


d) Compare your results with Exercise 2.17 and verify that they are consistent if you allow 
the degree of freedom to tend to infinity. 
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Exercise 9.12 (Sampling from an empirical distribution function) 
Let L1,..., L4 denote realizations of L ~ F and let 


1 
F,,(x) = - S Magos r cR, 
i=1 


be the empirical distribution function based on L1,..., Ln. Show that sampling from Fn 
corresponds to randomly drawing from L4,..., Ln with replacement. 


Exercise 9.13 (Square-root-of-time rule for serially independent risks) 


a) Assume the losses for the next h days can be modelled as independent L, ~ N(u, 07), 
t € {1,...,h}. Show that the total loss Lı +---+ Lp satisfies the rule 


VaRa(Li +--+ + Ln) = (h — Vh)u + Vh VaRa(I4) 


which, for u = 0, is known as the square-root-of-time rule. 
Note. In MFE (2015, Example 9.4) a multivariate version of this result based on independent 
multivariate normal random vectors is given. 

b) Suppose you hold 100 shares of a stock whose daily returns are independent normal 
distributed with mean 0.01 and standard deviation 0.2. What is VaR for a 9-day time 
horizon? 

c) Consider independent losses L4,..., Lp from a distribution with mean u € R and variance 
o? < oo. Show that for h large, 


VaRo (Li +--+ Ln) & hut V ho! (o). 


Exercise 9.14 (Square-root-of-time rule for serially dependent, jointly normal risks) 
Assume the losses for the next h days can be jointly modeled as (L4,..., Ln) ~ Na (H, X), where 
u = u1 € R for u € Rand X =o? P for a correlation matrix P with equal off-diagonal entries 
p € [0,1]. Show that the total loss Lı + ---+ Ln satisfies the rule 


VaRo(Li +-+ Ln) = (h — yA + (h — 1)0) Ju + V/A + (h — 1p) VaRa (Ia). 


Note. For p = 0 (independent L1,..., Ln), one obtains the (square-root-of-time) rule as in 
Exercise 9.13. For p = 1, we obtain VaRa(L1 +--:+ Lar) = h VaRo(L1) which is clear since 
VaR is comonotone additive. 


Exercise 9.15 (Tests based on VaR violations) 

Suppose you are a regulator trying to assess the quality of the value-at-risk system used by a 
bank. The bank has estimated the one-day VaRo.99 of its trading book every day for two years 
(500 observations). The system has produced 14 VaR0o.99 violations on days 7, 61, 70, 75, 90, 
130, 200, 245, 367, 371, 385, 403, 406 and 487. 
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a) How many VaRo.99 violations would you expect in this period if the bank has a very 
high-quality forecast model for its trading losses? 


b) Carry out a binomial test of the quality of the bank’s forecast model and report the result. 


c) Carry out a test of the hypothesis that the violations occur as an iid Bernoulli sequence. 


Exercise 9.16 (Elicitability and comparison of VaR estimates) 

The scoring function for VaRa(L) is given by S2(y,1) = |Iu<y} — o||l — y| where we recall that 
y stands for the forecast and / for the realized loss. 

a) Use software to draw a graph of S2(y,l) as a function of l for y= 1 anda € {0.5,0.9}. Use 
the graph to explain intuitively why the minimizer of y > E(S2(y, L)) for a = 0.9 is larger 
than for a = 0.5. 

Table E.9.1 contains 12 realized one-period losses (generated from a GARCH(1, 1) model 


2 


——t 
with standardized 13.5 innovations) and corresponding VaR,9g predictions VaRo99.n and 


t 

VaR,9o,, based on N(0, 1) innovations and fitted standardized t innovations, respectively. 
Use a suitable test statistic based on elicitability theory to compare the quality of the risk 
measure predictions and state whether the result is as you would expect. 


t 1 2 3 4 5 6 7 8 9 10 11 12 


Lia —0.73 —2.35 1.30 0.05 —0.93 —0.81 —0.06 0.45 3.72 —0.47 1.21 —0.18 
— >t 
VaRoooN 3-45 3.45 3.46 3.46 3.47 3.47 3.47 3.48 3.48 348 349 3.49 
~t 
VaRogog, 3-76 3.73 3.70 3.67 3.64 3.61 3.58 3.56 3.53 3.50 3.47 3.45 


Table E.9.1 Losses from a GARCH(1,1) model with standardized ts; innovations and corre- 


—~t t 
sponding VaRo.99 predictions VaRo 99 ny and VaR 99 , based on N(0, 1) innovations 
and fitted standardized t innovations, respectively. 
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Exercise 9.17 (Delta and delta-gamma approximation) 

'The aim of this exercise is to verify the formulas for delta and delta-gamma approximations in 
MFE (2015, Section 9.1.2). The derivation of the first-order Taylor series approximation (delta 
approximation) and the second-order Taylor series approximation (delta-gamma approximation) 
of the one-period loss random variable 


Lia = —(g(t + At, Z + Xu) — gme, Z)) 


is easier to carry out if one views the mapping g : R, x R? > R as a function of a single vector 

y= (Tt, 2), sO g(t, z) = gly). 

a) Derive the delta approximation Les of L441 by using the first-order Taylor series approxi- 
mation of g(y1) about yo for yo = (Te, z) and yi = (ri + At, z + x). 
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b) Derive the delta-gamma approximation LAN of L441 by using the second-order Taylor series 
approximation of g(y1) about yo for yo = (me, z) and yı = (te + At, z + x). 


Exercise 9.18 (Standard methods for market risk) 

The aim of this exercise is to compare the various statistical methods for market risk management 

discussed in MFE (2015, Section 9.2) in the case of a simple portfolio of two stocks. Consider 

BMW and Siemens (SIE) (which were also analysed in MFE (2015, Section 3.2.1)) from 3 

January 2000 to 31 December 2009; see the dataset EURSTX_const of the R package qrmdata 

for the data. You should work through the following steps: 

a) Load the data and plot the stock prices for the period in question. Also compute and plot 
the log-returns and construct a scatterplot of one series against the other. 

b) Consider a portfolio consisting of BMW and SIE stocks in the ratio 1:10. Write a function 
to evaluate the corresponding loss operator (in other words, a function which takes the 
matrix of risk-factor changes and the portfolio weights and returns the resulting losses). 

c) Implement as many of the following methods as you can: 

i) the variance-covariance method; 

ii) the standard historical simulation method; 

iii) the Monte Carlo method assuming a multivariate t distribution for the log-returns (you 
will need a function to estimate a bivariate t distribution such as fit.mst() in the R 
package QRM); 

iv) the method which uses a GPD approximation to the tail of the distribution of historically 
simulated losses (you will need a function to fit a GPD model to excess losses over a 
threshold such as fit_GPD_MLE() in the R package qrmtools). 

d) Compute VaRo.99 and ESo.99 estimates for each of the methods; for the Monte Carlo method, 
use a sample size of 104. 

e) Plot a histogram of the losses and display the computed VaRo.99 and ESo.99 estimates (for 
example with vertical lines). Briefly compare the values of the estimates. 


Exercise 9.19 (Standard methods for an option position) 
Suppose a bank has a position consisting of a short put option on the S&P 500 index and a 
long position A > 0 in the stock with total value in discrete-time mapping notation given by 


Vz = AS: — PBS (7, Sti T, 01, K, T), 


where PPS denotes the Black-Scholes put option formula with the usual terms (see MFE (2015, 
Example 9.1) for a similar situation with a call option). Suppose the option was sold on the 
4 June 2009 and that this is day t = 0. On this day 7 = 0, the SP500 value is Sọ = 942.46 
and the value of the VIX volatility index is 30.18 (so co = 0.3018). Consider an option with 
maturity T = 5 years and strike K = 1050. Assume that there is a fixed interest rate of r = 1% 
per annum. 

a) Determine A if Vo = 0; a function for calculating the value of a put option can be found in 

the R package qrmtools. 
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b) Taking the log stock price log S; and the volatility e; as the risk factors derive the linear 
and quadratic loss operators expressing changes in the value of the position in terms of 
changes in the value of these risk factors. 

c) Suppose that you have to measure the market risk of the position at the end of the day on 
4 June 2010, one year after the option was sold. On this day we have 7; = 1, 5, = 1064.88 
and o; = 0.3548. The data can be found in the datasets SP500 and VIX of the R package 
qrmdata. Use the period from 14 June 2006 to 4 June 2010 (1001 values). Estimate the 9996 
one-day value-at-risk and expected shortfall for the position using the following methods 
and summarize the differences in the results: 

i) the variance-covariance method; 

ii) the standard historical simulation method with full revaluation of the position; 

iii) the standard historical simulation method with a delta-gamma approximation for the 
change in the value of the position; 


iv) the univariate dynamic historical simulation method described in MFE (2015, Sec- 
tion 9.2.4 (2)) (use the losses from the full revaluation, an ARMA(1,1) model with 
GARCH(1, 1) errors and t innovations, and parametric estimates of value-at-risk and 
expected shortfall based on the fitted t innovations); see the R package rugarch. 


Exercise 9.20 (Elicitability of the exponential risk measure) 
For a > 0 and L with E(exp(aL)) < oo, the exponential risk measure o&*P(L) for the loss 
function /(r) = exp(oz) is given by the solution of the equation (in m) 


g(1 — exp(a(L — m))) =0, (E1) 


which leads to o£*P(L) = + log E(exp(aL)); see MFE (2015, Example 8.8) for details (we consider 
c = 0 for simplicity). Show that og? (L) is elicitable and give a consistent scoring function. 
Hint. For s(m,l) = 1 — exp(a(l — m)), m,l € R, (E1) can be written as E(s(m,1)) = 0. Use 
this to show that a consistent scoring function is given by S(y,1) = ff s(m, l) dm. 


Exercise 9.21 (Accuracy of the empirical quantile estimator) 

In this exercise we will attempt to understand the magnitude of the error when an empirical 

quantile is used to estimate value-at-risk. Let U;,...,U,, ^7 U(0,1) and let the corresponding 

order statistics be denoted Uq) < Ug) € +++ € Un). Let Fx be a continuous and strictly 
increasing distribution function and define X; = FX (U;), i € {1,...,n}, with corresponding 
order statistics Xa) X -+ € Xin) 

a) Let N ~ B(n,p) for n € N and p € [0,1]. Let G(z;a,6) denote the distribution function 
of a Beta(a, b) distribution with parameters a > 0 and b > 0. Show that the distribution 
function of N satisfies Fy(x;n,p) = G(p; |z| +1,n — [x |). 

Note. This is how R evaluates pbinom(x, size = n, prob = p). 

b) Hence show that U(,) has a Beta(k, n — k + 1) distribution. 


c) Derive an expression for Po (a) in terms of Fx and the quantiles of a beta distribution. 
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d) For notational convenience, let qx(u) = Fi (u) and 0, = EU (x), k € {1,...,n}. According 
to Taylor's Theorem, for u € (0,1) close to 6j, it is known that 


qx (u) © qx (605) + dx (0) lu — 00). (E1) 


Find approximations for E(X(,)) and var(X(,) in terms of qx (005), qx (Ox)), k and n. 


Exercise 9.22 (PCA factor model for US zero-coupon bond yields) 
Repeat the analysis of MFE (2015, Example 9.3) for daily US treasury zero-coupon bond yield 
data. Work through the following steps: 


a) Consider daily US treasury ZCB yields with maturities from one to 30 years from 2 January 
2002 to 30 December 2011. 


b) Apply principal component analysis (PCA) to the daily yield changes and determine how 
much of the total variability can be explained by the first three sample principal components. 


c) Plot each of the loadings corresponding to the three largest eigenvalues as a function of 
time and compare with MFE (2015, Figure 9.3). 


d) Plot the time series of the first three sample principal components. 
The data can be found in the dataset ZCB_USD of the R package qrmdata. 
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Exercise 10.1 (Management of counterparty risk) 
Explain the notion of counterparty risk and the challenges arising in the management of this 
risk type. Describe potential strategies for managing counterparty risk. 


Exercise 10.2 (Bond investment under decreasing credit quality) 

Consider a bond investor who expects the credit quality of bond issuer A to decrease. Develop 
a strategy to profit from this view by using CDS contracts with reference entity A and discuss 
the risks of the strategy. 


Exercise 10.3 (Rating transition matrices) 

A bank uses a simple internal rating system in which there are only two ratings, A and B, as 
well as a default state D. Suppose the one-year transition probabilities are given as follows, 
where the (i, 7)th entry denotes the probability of migrating from rating state i to j, with a 
few missing entries: 


A B D 
A 0.80 0.15 ¢ 


B 0.10 ? 0.20 
D ? ? ? 


Complete the table of transition probabilities and calculate the probabilities that A-rated and 
B-rated obligors default over a two-year period. 


Exercise 10.4 (Credit risk in the Merton model) 
a) Explain the modelling of the default of a firm in the Merton model and discuss the 


relationship between equity and debt and European options on the asset value of the firm. 


b) Consider a firm whose asset value follows a geometric Brownian motion with drift wy = 0.1 
and volatility oy = 0.2. Assume that the current (t = 0) value of the assets is equal to 
200, that the nominal value of the liabilities is equal to 100 (all in MEUR) and that the 
maturity of the liabilities is in T = 1 (year). Calculate the default probability of the firm. 


c) Discuss strengths and weaknesses of the Merton model as a typical structural model. 


Consider in particular the properties of credit spreads. 


d) How does the price of the risky debt react to an increase in the volatility oy of the asset 
value of the firm? How is this related to the incentives of bondholders? 
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Exercise 10.5 (Market-based credit risk management and procyclicality) 

Why might the use of a credit risk management system whose main input are market prices 
such as the public-firm EDF model described in MFE (2015, Section 10.3.3) lead to procyclical 
capital requirements and what are the resulting economic problems? 


Exercise 10.6 (Limitations of risk-neutral pricing) 

Explain the pros and cons of risk-neutral pricing for credit products and contrast risk-neutral 
pricing with the more traditional pricing methodology for loans. Which approaches would you 
use for the following products? 


a) A retail loan to a small or medium sized company. 


b) An option that gives the holder the right to enter into a CDS contract on a major corporation 
at a future date for a spread fixed today. 


Exercise 10.7 (Models with stochastic hazard rate) 
Explain why a simple hazard-rate model is not suitable for the computation of VaR for corporate 
bonds. 


Basic 


Exercise 10.8 (Risk-neutral valuation) 

Consider a simple one-period model for the price of a defaultable zero-coupon bond with 
nominal value 1, maturity T = 1 (year) and with deterministic recovery rate 1 — 6 = 0.4. The 
default probability of the bond is 1 — p = 0.01, the risk-free simple interest rate is 0.025, and 
the current (t = 0) price of the bond is p1(0, 1) = 0.961. The price of the bond in T = 1 is thus 
either 1 (in the case of no default) or 0.4 (in the case of default). 


a) Compute the expected payoff of the bond. 
b) Determine the corresponding risk-neutral default probability. 


c) Compute the price and replicating portfolio for a stylized credit default swap with payoff 
I;,<7}, where r denotes the time of default of the bond. 


Exercise 10.9 (Estimating transition probabilities for a discrete-time Markov chain) 

Let S = {0,...,n} be the set of rating states of increasing creditworthiness with 0 representing 
default. For t = 0,...,T — 1, j € S\ {0} and k € S, let Ny; denote the number of companies 
that are rated j at time t and which are still in the rating system at time t + 1, and let Nijk 
denote the number of those companies that are rated k at time t+1. A discrete-time, stationary 
Markov chain is fitted to the data (N;j) and (Nijx). 


Show that the maximum likelihood estimator of the transition probability pj; is given by 
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Hint. Use the fact that conditional on Nj, the numbers migrating to each state k have a 
multinomial distribution. 


Exercise 10.10 (Default probability in Merton’s model) 

In Merton’s model assume that the growth rate of assets wy is positive and that the initial 
asset value Vo exceeds the liability B. Show that the probability of default at the horizon date 
T is an increasing function of the volatility oy. 


Exercise 10.11 (The Pareto survival model) 
The Pareto distribution with shape parameter a and scale parameter &, denoted Pa(a, x), has 
survival function 


a 
Pit) = ( A »s t>0, k&»0, a>0; 
k+t 


provided that a > n, the moments of 7 ~ Pa(a, &) are given by E(7”) = &"n!/ Ii (a — i). 
a) Derive the hazard function y(t) of this distribution. Is the hazard rate increasing or 
decreasing in t? 


b) Show that 7 ~ Pa(a,«) can be considered as a mixture model with mixing variable 
A ~ Ga(a, k) where, given A = A, T has an exponential distribution with parameter A. 


c) Suppose a > 2 and consider an exponential random variable 7 with parameter À = (a—1)/k. 


Show that E(7) = E(7) but var(r) > var(7). 


Note. This illustrates that mixing over A leads to greater variation in the realizations of T. 


Exercise 10.12 (The Gompertz survival model) 
The Gompertz model is widely used by actuaries in mortality modelling. The distribution 
function is given by 


F(t)=1 exp( zie i t>0,a>0, b>0. 


a) Caculate the hazard function and the cumulative hazard function of this distribution 
function. 


b) The Gompertz-Makeham model is an extension of the Gompertz model. If the hazard 
function of the Gompertz distribution is yq(t), the Gompertz-Makeham distribution has 
hazard function yem(t) = ya(t) + c for some constant c > 0. Calculate the distribution 
function of this model. 


Exercise 10.13 (Reduced-form credit spread under different recovery assumptions) 
Suppose that under the risk neutral measure Q, the default time 7 follows a hazard rate model 
with continuous hazard function yẹ. Consider a defaultable zero coupon bond with maturity 
date T and a valuation date t < T such that 7 > t. Derive a formula for the price pi(t, T) and 
the credit spread c(t, T) of a defaultable zero coupon bond in the following recovery models: 
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a) RT (recovery of treasury); 
b) RF (recovery of face value). 


In both cases, give the limit of the spread for the case where T converges to t. 


Exercise 10.14 (Pricing and calibration of a CDS) 

Consider a CDS on firm R with time to maturity T = 5 years and quarterly premium payments. 

Assume that the risk free continuously compounded interest rate is equal to r > 0 and that 

under the risk-neutral measure Q the default time Tp of the reference entity follows a hazard-rate 

model where the hazard rate is constant and equal to 72 > 0. Assume moreover that the 

loss-given-default of the reference entity is given by the constant 6 € (0, 1]. 

a) Compute the value in t = 0 of the CDS cash-flows from the viewpoint of the protection 
seller for a generic annualized CDS spread zx. 

b) Suppose now that the annualized spread of the CDS observed on the market is z* — 42 bp, 
that the loss-given-default is 6 = 0.6 and that r = 0.02. Calibrate a constant hazard model 
for TR under the risk-neutral probability measure Q. Use both the approximate formula 
59 = z* /ó and exact calibration (numerical root finding is required for the latter). 

c) Can the value of 59 computed as in b) be used for computing the VaR of a credit exposure 
towards R such as a loan? 


Exercise 10.15 (Application of the Feynman-Kac formula) 

Use the Feynman-Kac formula, see MFE (2015, Lemma 10.21), to solve the following two 

terminal value problems. In both cases u € R and o > 0 are given constants. 

a) fit, v) ufu(t; v) Lo? fv (t, V) = 0, (t, v) € [0, T) x R, with terminal condition f(T, V) = 
y. 

b) filt, Y)+uv fylt, v) 3o?v? fyylt, v) — rf(t,v), (t, Y) € [0, D) xIR, with terminal condition 
F(T, v) = log v. 


Exercise 10.16 (Vasicek model) 
Consider the stochastic differential equation (SDE) for an Ornstein-Uhlenbeck process, 


dy, = a(b — v) dt--ocdW;, a,b>0. 


This SDE is sometimes used as a model for the short-rate of interest known as the Vasicek 
model. Define the function F(t, Y) = E(exp(— D Vs ds) | v: = v), the price of a default-free 
zero coupon bond. Conjecture that F(t, Y) = exp(a(t, T) + B(t, T)v) and derive a system of 
ordinary differential equations for o and f. 


Exercise 10.17 (Simplified CDS pricing) 

In this exercise we consider an approximation that reduces CDS pricing to the pricing of 
survival claims. Consider a CDS with deterministic loss-given-default 6 and premium payment 
dates tı < t2 <---<ty = T, and let to = 0. We slightly modify the standard definition of the 
payoff of the default payment leg and assume that if t; < rg < tj41 for any 0 € i € N — 1, the 
protection buyer receives the amount 6 at time £;,4 (and not directly at TR). 
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a) Denote by Q the risk-neutral measure and by (r;) the short rate of interest. Show that the 
price at (for simplicity) t = 0 of the modified default payment leg equals 


tip s 
3 E? (e Jo d (Iirr>ti1} = Tisi) (El) 


i=1 


Suppose that (r+) is a positive process. Is (E1) bigger or smaller than the value of the 
standard default payment where the default payment is made directly at TR? 


t 
b) Suppose that the short rate is deterministic and let po(t, T) = e^ Jo (945. Show that in 
this case, (E1) can be written as 


N-1 


5(po(0.t) — p(0, tv) Q(tr > tw) + X- Q(TR > ti)(po(0, ti+1) -in(0.1))). (E2) 
$1 


c) Use the approximation formula (E2) to compute numerically the fair spread of a CDS 
with maturity T' — 5 years and quarterly premium payments. Assume that under Q 
the default intensity follows a Cox-Ingersoll-Ross (CIR) process with dynamics dV, — 
k(0 — V,) dt + c /V, dW; with parameters Vo = 0.03, k = 1, 0 = 0.04 and o = 0.2, and 
assume r(t) = 0.01, 6 = 0.6. How does an increase in Vo, 0 or o affect the spread? 


Advanced 


Exercise 10.18 (Matrix exponential for continuous-time Markov chain) 

The matrix exponential of the Markov chain generator A € R(™+))*("+) for some n € N can 
be calculated using a number of software packages. One case where it can be calculated by 
simple matrix multiplication occurs when the generator is diagonalizable, meaning that there 
exists an invertible matrix A € RU D*(*J) such that A~!A.A = D where D = diag(do, ... , dn) 
is a diagonal matrix containing the eigenvalues of A. Show that, in this case, the matrix of 
transition probabilities P(t) for the interval [0, t] may be written as 


P(t) = Adiag(e^**, . . . , edt) A71. 


Exercise 10.19 (Calibrating hazard rate model when CDS spread curve is flat) 
Suppose that the positive constant 7° satisfies 


“Aten HAE = 550 (^. oreet 
q Ate "TI = OF 1 gy rogi (E1) 
0 
a) Show that 72 must also satisfy 


N M NAt 2 
ony. eT UTI At = 59° f e + tat forall NEN. 
k=1 b 
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b) Conclude that, when the CDS spread curve is flat, when interest rates are constant, and 
when premium payments follow a regular schedule, a constant risk-neutral hazard rate 72 
can be determined from a quoted market spread x* by solving (E1). 


Exercise 10.20 (Risk management for a corporate bond) 

Consider a single corporate zero-coupon bond with time to maturity T = 2 years and denote 
by 7 the default time of the bond. The recovery value of the bond is zero so that its payment 
at T is I {>T} and the current price of the bond is pọ. Assume that the risk-free interest rate 
is constant and equal to r > 0, that under the historical probability measure P, 7 is doubly 
stochastic with hazard rate process 4? and that under the risk-neutral measure Q, 7 is doubly 
stochastic with hazard rate process y2. Moreover, 


4T = vx and ye = 2v, 


where (y+) follows a CIR process with P-parameters K”, 0”, o > 0 and Q-parameters «9, 09, o > 
0. 


a) Develop a simulation algorithm to compute the loss distribution and the VaR of the bond 
over the time horizon T = 1 year. Define for this the function 


t2 
plti, t2, V; T, P, R, 0, c) = 3(exr(- n r+ ps ds) | Vt = v) ? 
1 
where (y) follows a CIR process with generic parameters &,0,c. Note that the function 
plti, to, Y; r, p, &,0,c) is known explicitly, see MFE (2015, Section 10.6.2), but its precise 
form is not required in this exercise. Explain, for which part of the simulation algorithm 
risk-neutral, respectively historical, quantities are needed. 


b) Modify the algorithm proposed in a) so that it can be used for pricing a put option on the 
bond with maturity T = 1 and exercise price K = po via Monte Carlo simulation. 


Exercise 10.21 (Credit value adjustment formula) 
The theoretical background for this exercise is discussed in MFE (2015, Section 17.2), but it 
can be solved with a basic understanding of counterparty risk. 

Consider two parties S and B who enter into a contract where the protection seller S provides 
protection against an adverse event and the protection buyer B pays premia in return. A case 
in point would be a credit default swap between protection seller S and protection buyer B or a 
reinsurance treaty between the primary insurer B and the reinsurer S. The market value of this 
contract at time t from the viewpoint of B is denoted V; and the maturity of the contract is T. 
To account for the possibility that S might default before T, a value adjustment is computed. 
Under the simplifying assumption that the default time of S and the counterparty-risk free 
price process (V;) are independent and this value adjustment is given by 

m 


CVAindep — 5f et EQ (V+) fa(t) dt. 
0 


Here ó? gives the loss given default of S; r > 0 is the risk-free interest rate, and fg denotes the 
density of the default time 7g of S. 
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a) Explain the formula for CVA@*?, 

b) Discuss the assumption that the market value (V;) of the contract and the default time Tg 
are independent for the special case of a reinsurance treaty between primary insurer B and 
reinsurer S. 

c) Evaluate the formula for CVA@deP in the case where V, is normally distributed under Q 
with mean 0 and variance c?t?. Assume that, under Q, 7s is exponentially distributed with 
parameter ^g. 

Hint. For a > 0, the indefinite integral of ze / Qo?) is given by —ae-* /Qo7). For y 0, 
the integral of te~” is given by —(1/^)e ^?! (t + 1/4). 
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Exercise 11.1 (Dependence in credit risk management) 

a) Give economic reasons why it is generally not appropriate to assume independence between 
the default of different obligors in a loan portfolio. 

b) How does dependence between defaults affect the loss distribution in a typical loan portfolio? 
Discuss implications for managing the risk of loan portfolios. 


Exercise 11.2 (Gaussian threshold models) 

Consider a Gaussian threshold model (X,d) where the critical variables X follow a factor 

model. 

a) Describe the mathematical structure of such a model and explain the advantages of using a 
factor model for X. 

b) Explain the difference between default correlation and asset correlation in this context . 

c) How is the threshold d; chosen in order to calibrate the model to a given default probability 
pi for obligor i? 


d) Derive the form of an equivalent Bernoulli mixture model. 


Exercise 11.3 (Tail dependence in threshold models) 
Consider a threshold model (X,d) and explain why lower tail dependence of the critical 
variables might have a substantial impact on the tail of the credit loss distribution. 


Exercise 11.4 (Exchangeable Bernoulli mixture models) 

Consider an exchangeable Bernoulli mixture model for m obligors and denote by the random 

variable Q the conditional default probability. 

a) Derive the relationship between the unconditional default probability m and the default 
correlation py and the first two moments of Q. Why is py always nonnegative in Bernoulli 
mixture models? 

b) Explain why for large m the tail of the distribution of the number of defaults is ‘up to first 
order’ determined by the tail of Q. 


Exercise 11.5 (Importance sampling in credit risk models) 


a) Describe the basic idea of importance sampling and the motivation for using the method in 
credit risk. 
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b) Assume that a Bernoulli mixture with mixing variable W is used to model the default 
state of a given credit portfolio. Why might importance sampling for the conditional 
loss distribution as described in MFE (2015, Algorithm 11.24) not suffice to obtain a low 
variance of the IS estimator, in particular for large portfolios. 


Basic 


Exercise 11.6 (Calculating moments of portfolio credit loss distributions) 

Consider a portfolio containing m — 1000 equally rated credit risks. Assume that for every 
obligor i € {1,...,m} the exposure is e; = 1M GBP and the default probability is p; = 1%. 
Calculate the expected value and standard deviation of the portfolio loss in the following 
situations. 


a) Loss-given-defaults (LGDs) are modeled as deterministic and 6; = 0.4, i € (1,...,m]. 
Defaults are assumed to occur independently. 


b) LGDs are modeled as deterministic as in a) but defaults are assumed to be dependent with 
pairwise default correlation (that is correlation between pairs of default indicators Y;) given 
by p = corr(Y;, Yj) = 0.005 for i Æ j. 

c) Defaults are dependent as in b) but LGDs are modeled as random variables A; satisfying 
Ai ~ Beta(a,b), i € {1,...,m}, where a = 9.2 and b = 13.8. Assume that LGDs 
are mutually independent across the portfolio and independent of the default indicator 
variables. 


Hint. E(AT) = Tip sedi, k EN. 


Exercise 11.7 (Correlation bounds for default indicators) 
Let Xj D B(1, pj), PIE (0, 1), Le {1,2}. 
a) Show that the correlation coefficient p of (X1, X2) satisfies Pmin < p < Pmax where 


M (p1, p2) — pipe 
Vpi(l — pi)p2(1 — p2)” 


RE W (p1, p2) = Pipa nd mores 
min ~~ max =; 
Vpi(1 — pi)p2(1 — p2) 


for the countermonotonicity copula W and the comonotonicity copula M. 
Hint. Use Hóffding's Theorem (MFE (2015, Theorem 7.28)). 


b) Consider the case p; = p2 =: p. For which value of p is the range of p maximal and what is 
this range? 


Exercise 11.8 (Asset correlations in Gaussian threshold models with two-group structure) 
Consider a Gaussian threshold model (X, d) in which the critical variables follow the factor 
model 


X; = B; E; + 1-— Bigs, iE {1,... m}, 
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where E, €1,...,€q are independent standard normal variables and 0 < 6; < 1 for all i. The 
systematic variables F; in turn satisfy 


di Fi, Ll een 
"C (pF + J1— gn i—-ncl...,m 


where PFi,F5 are independent standard normal factors, 0 < p < 1 and 1 « n « m — 1. 
Obviously, this model has a two-group structure determining the dependence between defaults. 


Derive expressions for the within-group asset correlations (for each of the two groups) and the 
between-group asset correlation. 


Exercise 11.9 (Joint default probabilities in Gaussian threshold models) 
Consider a Gaussian threshold model (X, d) in which the critical variables follow the one-factor 
model 


Xi = VBUF + 1- Bie, UC vm 
Here F,€1,...,€m are independent standard normal variables and 0 < 0; < 1 for all i. The 
goal of this exercise is to describe the equivalent Bernoulli mixture model. 
a) For 1 € k < m, express the joint probability that the group of obligors {71,...,i,} C 
(1,..., m) all default as a one-dimensional integral over the distribution of the factor F. 


b) Show that in the exchangeable version of this model (that is for d; = d and B; = p = 
corr( X j, Xj) for all i, j, 0) the formula in a) leads to 


«f (Pis) e 


that is the kth order joint default probability 7; is related to the default probability m of 
each obligor according to this formula. 


Exercise 11.10 (Joint default probabilities in threshold models with Gumbel copula) 
Derive a formula that relates higher-order default probabilities 7; and individual default 


probabilities 7 in an exchangeable default model of threshold type based on the Gumbel copula. 


Exercise 11.11 (Exchangeable one-factor Bernoulli mixture models) 

Suppose you have a portfolio of m similarly rated obligors. You decide to model the default state 
of this portfolio by an exchangeable Bernoulli mixture model with mixing variable Q. Recall 
that in this model, the default indicators Y; € (0,1) satisfy P(Y; = y; |Q = q) = q5(1— q)! ^vi, 
yj € {0,1}, j € {1,..., m), so that 


qa - gi = qat = gm han 


mall 


P(Y =y|Q= q)= 
1 


j 


a) Show that the kth order joint default probability 7, = P(Yj = 1,...,Y;, — 1l, 1 < 
ji <- < jk < m) of any subgroup of k € {1,...,m} components defaulting satisfies 


Tk = EO"), k € {1,..., M}. 
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b) Compute the implied default correlation py = corr(Y;,, Yj.) for j1,j2 € {1,...,m}, ji Æ ja. 
in terms of mı and 79. 


Exercise 11.12 (Exchangeable beta mixture model) 
Suppose the credit risk of a portfolio of m similar obligors is modelled with an exchangeable 
Bernoulli mixture model with mixing variable 


Q ~ Beta(a, b), 


that is the mixing variable Q is distributed according to a beta distribution with parameters a > 

0, b > 0 and density fo(z) = z*-1(1—2)^71/f(a, b), x € (0,1), where £(a, b) = l'(a)F (b)/T (a+b) 

denotes the beta function. 

a) Let k € {1,...,m}. Compute the kth order joint default probability a, = P(Y; = 
15:415, = 1) for all lagi <+ < jk < mM. 
Hint. T(z +1) = 2r (z), z > 0. 

b) Determine a,b such that the individual default probability mı is 1% and the default 
correlation py is 0.5%. 

c) Let M = De Y; denote the number of defaults in the portfolio. Show that its probability 
mass function (pi). is given by 


py = P(M = k) = 


(t) B(a -- k,b -- m — k) 
k B(a, b) í 


d) Compute the probability mass function p, = P(M = k), k € {0,...,m}, under the 
assumption that the default indicators Y;'s are independent. 


e) Suppose m = 1000. For k € {0,...,60}, plot p; and fy. Also plot pk/Pk and comment on 
the plot. 


Exercise 11.13 (Exchangeable probit-normal mixture model) 

Suppose the credit risk of a portfolio of m similar obligors is modelled with an exchangeable 
Bernoulli mixture model where the mixing variable Q has a probit-normal distribution with 
parameters u € R and o > 0, that is 


Q-—6(u-c-oZ2) Z~N(0,1). 


Therefore, Q = ®(X) for X ~ N(u,o?) and thus, by Exercise 11.11 a), 7, = E(®(X)*), 
k € (1,..., m]. 


a) Derive the distribution function and density of Q. 


b) Use Exercise 11.9 to infer that the exchangeable mixture model is equivalent to an exchange- 
able Gaussian threshold model with default probability a and asset correlation p which you 
should determine as functions of u and o. 


c) Use these results to give the mean E(Q) of the probit-normal distribution. 
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Exercise 11.14 (Exchangeable Bernoulli mixtures with extremal mixing) 
Consider an exchangeable Bernoulli mixture model for m obligors with given default probability 
P(Y; = 1) =7 € (0,1) but unspecified distribution function G(q) of the mixing variable Q. 


a) Determine the mixing distributions G™™(q) and G™*(q) such that the default correlation 
min 


py attains the bounds py" = 0 respectively pp^* = 1. 
b) Show that for the mixing distribution G™**(q) the default indicators are comonotone. 


c) Let L = 72:4 Y; (exposures are ignored for simplicity). Show that for the mixing distribution 
G™*(q) the expected shortfall ES,(L) is maximized over all mixing distributions G(q) with 
le qdG(q) = « (that is all mixing distributions that are consistent with the constraint 
P(Y; = 1) 2 7). Does G™**(q) also maximize Value at Risk (proof or counterexample)? 


Exercise 11.15 (Two-factor CreditRisk* model) 

Consider a portfolio of m = 1000 obligors with an exposure of e; = 1M for each obligor 
i € {1,...,m}. Suppose we model dependent defaults in the portfolio using a two-factor 
CreditRiskt-style of model as in MFE (2015, Section 11.2.5). Assume that the default count 
variables Y;, see MFE (2015, Section 11.2.5), satisfy 


- 7 _, [ Poi(0.019), i=1,...,500, 
Wh Ta) = (a, Pa) EA +0.5y2)), i=501,..., 1000. 


ind. 


Also assume that loss-given-default is 100% in all cases and that V1, V5 ^" Ga(1/2,1/2); note 


that the second parameter 1/2 is the rate parameter of the gamma distribution. Compute the 
mean and variance of the portfolio loss. 


Exercise 11.16 (Large portfolio asymptotics application) 

Over the years a retail banking division specialising in small commercial loans has had a 
consistent lending policy: 50% of its loans have been for the amount of 5M and 50% of its loans 
have been for the amount of 1M. Moreover, 50% of both the larger and smaller loans have been 
rated as ‘risky’ and have been assigned a default probability of 1% per annum, whereas the 
other 50% have been rated as ‘safe’ and have been assigned a default probability of 0.1% per 
annum. 

The bank uses a one-factor Gaussian threshold model for its portfolio and carries out a fully 
internal calculation for economic capital purposes. In the one-factor model the risky loans are 
assumed to be 80% systematic (that is 80% of the variance of the driving ‘asset value’ variable 
is assumed to be explained by systematic factors) whereas the safe loans are assumed to be 
only 20% systematic. A deterministic loss-given-default of 0.6 is assumed. 

The portfolio consists of m = 10000 individual loans and the bank decides to use a large 
portfolio argument to compute the 99.9% value-at-risk. 


a) Derive the form of the asymptotic relative loss function /(»), see MFE (2015, Section 11.3.2), 
under the assumption that the portfolio is grown ad infinitum with the same lending policy. 


b) Approximate the 99.9% VaR for the portfolio. 
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Exercise 11.17 (Large portfolio asymptotics with stochastic LGD) 
Consider a Gaussian threshold model (X,d) with the critical variables following the one-factor 
model 


X; = JB F + V1 = Bisi, Ue {1,... mM}, 


ind. 


where F,€1,...,€m ~ N(0,1) and 0 < 6; < 1 for all i. For i = 1,...,m, let Y; = Iyx,<a3 
denote the default indicator variables and p; = P(Y; = 1) the default probabilities. 
a) Show that the model is equivalent to a one-factor Bernoulli mixture model for the default 


indicators where the common factor is V = —F and the conditional default probabilities 
take the form 


pil) = P(Y; = 1|V — v) = Oui + aiv). 


Also derive expressions for u; and oj. 


b) Suppose there are m = 10000 obligors in the portfolio, half of them have exposure 2M GBP, 
default probability p; = 0.01 and factor loading b; = 0.6 and the other half of them have 
exposure 4M GBP, default probability p; = 0.05 and factor loading b; = 0.8. Assume the 
loss-given-default (LGD) is 0.6 for all obligors. Use a large portfolio argument to compute 
an approximation for the 99% value-at-risk of the portfolio loss. 


c) Now suppose that a stochastic LGD A; depending on the economic factor V is introduced 
into the model for every obligor i. It is assumed that 


i) LGDs are conditionally independent given WV; 
ii) they are independent of the default indicators given V; and 
iii) the expected LGD given W satisfies E(A; | U = v) = (0.5 + v). 


Recompute the approximate 99% value-at-risk to incorporate the stochastic LGDs. 


Exercise 11.18 (Importance sampling with exponential tilting for gamma) 
Let X ~ Ga(o,8) be a gamma distributed random variable with corresponding density 
fre ae exp(—fx), x > 0, and moment-generating function Mx (t). 


a) Compute the exponentially tilted density g(x) = exp(tx) fx (xz) /Mx(t), x > 0. What are 
the constraints on the value of t? 


b) Can the mean of X be shifted to arbitrary positive values under the importance sampling 
density? 
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Exercise 11.19 (Importance sampling with exponential tilting for Poisson) 


a) Let X ~ Poi(j) under the probability measure P. Determine the distribution of X under 
the probability measure Q obtained by exponential tilting; in other words, what is the 
distribution of X after exponential tilting? 


b) Let X; EV Poi(u;), à = 1,...,m, under the probability measure P. We are interested in the 
distribution of the random variable L = 77-4 e; X; where e1, ... , €m are known deterministic 


exposures. Suppose we change the probability measure to Q; by exponentially tilting the 
random variable L. What is the joint distribution of (X1,..., Xm) under Q;? 


c) Suppose we want to estimate the exceedance probability P(L > c) of L over some threshold 
c > E(L) with a Monte Carlo simulation. Since c >> IE(L), we would like to use as variance 
reduction technique importance sampling based on a proposal distribution Q; under which 
L has mean c. Suppose this proposal distribution is obtained by exponentially tilting the 
distribution of L. What equation should we solve to determine the exponential tilting 
parameter t? 


Advanced 


Exercise 11.20 (Importance sampling for CreditRisk* ) 
How are Exercises 11.18 and 11.19 relevant to the problem of using importance sampling to 
estimate tail probabilities of the loss distribution in CreditRiskT? 


Hint. You may assume the one-factor case. 


Exercise 11.21 (Probability mass function of number of defaults under exchangeability) 
Let Yi,..., Ym ~ B(1, p) be exchangeable default indicators of a portfolio of size m and let 
M = 33. Y; be the number of defaults in this portfolio. For k € {1,...,m}, derive an 
expression for P(M = k) (that is for the probability that exactly k obligors default) in terms of 
the higher-order default probabilities Tk, ..., Tm. 


Note. Recall that my is the probability that an arbitrarily selected subgroup of at least k obligors 
default. 


Exercise 11.22 (Threshold model with Archimedean survival copula) 

Let (X,d) be an m-dimensional threshold model where X has an Archimedean survival copula 

with generator given by the Laplace transform G of some distribution function G on [0, oc) 

with G(0) = 0. 

a) Show that this model is equivalent to a one-factor Bernoulli mixture model with a factor V 
whose distribution function has Laplace transform G and show that the conditional default 
probabilities are given by 


pi() 21-exp(-UG Q0 — p), i€(L..., m) 


where p; is the unconditional default probability. 
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b) Assume now that V ~ Ga(a, o), a > 0, so that G is the Laplace transform of a gamma 
distribution. In this case, X has a so-called Clayton survival copula. Show that 


pi() 21—exp(—vo((1— p) V^ —1), i {1,...,m}. 


c) Let k; = a((1 — pj) /* — 1), i € (1,..., m}. Show that kj z pj for small pj. 

d) Explain why this threshold model is equivalent to the Bernoulli mixture model implied by 
a one-factor version of CreditRisk* with heterogeneous default rates. 

e) Compute the joint default probability of obligors i Z j with identical unconditional default 
probabilities p :— pi = pj € (0,1). Work out how this probability behaves as a — 0+ and 
a — co. Which direction increases the amount of dependence in the model? 


Exercise 11.23 (Copula of exchangeable CreditRisk* model) 

Suppose that Y — (Yi, - Ym) follows an exchangeable one-factor CreditRisk" model in 
which the Y; are conditionally independent and Poisson-distributed with parameter cV where 
V ~ Ga(o, a) and c > 0, a > 0. Let Y; = Ir >o} be the default indicator for firm i. Show that 
for any 1 € k € m and distinct i4,..., i; € (1,..., m], 


where Col is the survival copula of a k-dimensional Clayton copula Cf!(u1,..., uz) = (a uz’-— 


k+1)~'/° with parameter 0 that you should determine. 
Hint. Show that 7, = P(Yi, 20,...,Y;, = 0) = C$ (8,..., ft) for all k € (1,..., m] and use 
the fact that this implies that mg = Cf (m,...,) for all k € (1,...,m]. 
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Review 


Solution 1.1 (Types of financial risk) 


a) Market risk is the risk of a change or loss in the value of a financial position due to changes 
in the underlying components. 


Examples: 
A loss of value of a European call option due to a loss in value of the underlying stock. 


A drop of a bond price (and thus a loss if the bond is sold) as interest rates rise. This is 
due to the fact that investors (bond buyers; money lenders) are not willing to pay the 
face value to own a bond which pays less interest than other bonds in the market. 


Assume that on 1 January 2015, USD/CAD = 1.18 (so 1USD = 1.18CAD) and you 
decide to purchase Canadian dollar for 1000 USD, so you owned 1180 CAD. On 1 January 
2016, you would like to change the money back to US dollars but USD/CAD = 1.38 (so 
1CAD = 1/1.38USD) and thus you only receive 1180/1.38 = 855.07 USD. 


b) Credit risk is the risk of a counterparty failing to meet its contractual obligations. 


Examples: 
A consumer failing to make a payment due on a credit card or other loan. 
An insolvent bank not returning funds to a depositor. 


An insolvent insurance company not paying a policy obligation. 


c) Operational risk is the risk of a loss resulting from inadequate or failed internal processes, 
people and systems or from external events. This definition includes legal risk, but excludes 
strategic and reputational risk. 


Examples: 

(Internal or external) fraud. 

Fat-finger trades. 

Earthquakes (for example Kobe earthquake in the case of Barings Bank). 
High-frequency or automatic trading (for example the Flash Crash of 6 May 2010). 
Terrorism attacks (for example, the 11 September 2001 attacks). 


Crashed or hacked computer systems. 


d) Model risk is the risk of using a inadequate or wrong model for modelling risk. 


Examples: 


If losses come from a heavier-tailed (Student) ¢ distribution, using a normal loss distribu- 
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tion is inadequate and may lead to an underestimation of risk capital. 


= The 1998 debacle of the hedge fund Long Term Capital Management (LTCM) (whose 
board included the 1997 Nobel Prize winners in Economics Myron S. Scholes and Robert 
C. Merton) is attributed to model risk through the heavy use of high-risk convergence 
trades; see also Brown et al. (2015) for other reasons. 


= The pricing of collateralized debt obligations (CDOs) has been affected by model risk as 
models were used which cannot capture the notion of tail dependence; see, for example, 
Salmon (2012). 


e) Systemic risk is the risk of a collapse of an entire financial system due to the propagation 
of financial stress through a network of participants. 
Examples: 


= The bankruptcy of Lehman Brothers in 2008 resulted in a domino effect which generated 
major risk(s) to the global financial system due to the size and integration of the company 
in the financial system and economy. 


= Simultaneously, American International Group (AIG) suffered serious financial problems. 
However, in contrast to Lehman Brothers, AIG was bailed out by the U.S. government 
for 180B USD as it was considered ‘too big to fail’. Analysts and regulators believe that 
a bankruptcy of AIG would have caused numerous other financial firms to collapse, too. 


Solution 1.2 (Financial crisis of 2007-9) 

An outline of the crisis is given in MFE (2015, Section 1.2.1) but there is a huge amount of 
literature to consult, both academic and popular. The 2010 book ‘The Big Short’ by Michael 
Lewis, which was made into a 2015 film of the same name, gives much insight into the sub-prime 
lending scandal in the US and the role of securitization in the crisis. For UK readers ‘Shredded: 
Inside RBS, the Bank That Broke Britain’ by Ian Fraser is an entertaining account of the fall 
from grace of one of the UK’s major banks. 


Solution 1.3 (Bank run) 

A bank run (also known as a run on the bank) occurs when a large number of people withdraw 
their money from a bank, because they believe the bank may cease to function in the near 
future. In other words, it is when, in a fractional-reserve banking system (where banks normally 
only keep a small proportion of their assets as cash), a large number of customers withdraw 
cash from deposit accounts with a financial institution at the same time (because they believe 
that the financial institution is, or might become, insolvent) and keep the cash or transfer it 
into other assets, such as government bonds or precious metals. 

An example from recent history is Northern Rock. On Friday 14 September 2007 this was the 
first British bank in 150 years of history to experience a bank run which eventually resulted in 
its failing. Numerous articles have been written on the topic, start for instance with ‘Northern 
Rock: Lessons of the fall. How a financial darling fell from grace, and why regulators didn’t 
catch it! in The Economist in 2007. 
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Solution 1.4 (Regulatory frameworks) 


a) 


Both frameworks share a three-pillar concept with similar meaning of the three pillars. We 
focus on Basel II here, the second of the Basel Accords (a set of international banking 
regulations developed by the Bank for International Settlements to promote the stability 
of the international financial system), originally published in 2004 and implemented by 
2008. Basel II aims at controlling how much capital banks are required to hold to guard 
against future losses. To this end, it introduced three parts (the pillars). Pillar 1 (minimal 
capital requirements) concerns the quantification of regulatory capital to ensure solvency 
of the financial firm under consideration; most of the modelling done in quantitative risk 
management concerns Pillar 1. Pillar 2 (supervisory review) imposes regulatory oversight of 
the modelling process (including risks not considered in Pillar 1) and encourages financial 
firms to apply sound techniques for modelling, monitoring and managing risks. Pillar 3 
(market discipline) contains disclosure requirements with the aim to improve public disclosure 
of risk measures and other risk management related information. 


Common features of Basel II and Solvency II are: 


= The goal of ensuring sufficient funding to account for future risks (although the goal of 
the Basel Accords is rather the stability of the international financial system whereas 
the Solvency framework focuses more on protecting individual policyholders). 


= The use of value-at-risk (VaR) as risk measure. 
= Both frameworks have an impact on decision-making in risk management. 


# Both frameworks have a three-pillar concept (Pillar 2 and 3 are rather similar; the content 
of Pillar 1 is quite different in the two frameworks). 


Key differences between the two frameworks are (all under Pillar 1): 
= Scope: 
Basel II: The Basel Accords apply to the banking industry worldwide. 
Solvency II: The Solvency directives apply to the European Union. 
= Implementation year: 
Basel II: 2008 
Solvency II: 2016 
= Risk classes: 
Basel II: Mainly market, credit and operational risk. 
Solvency II: Comprehensive approach including all major risks (higher complexity). 
= Risk measure: 


Basel II: The VaR confidence levels and time horizons depend on the risk class (99% 
for market risk over two weeks; 99.9% for credit and operational risk over one year). 


Solvency II: The VaR confidence level is 99.5% over one year. 
= Perspective: 


Basel II: Retrospective; depending on the risk class, the risk calculation takes place 
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more frequently (for example for market risk). 
Solvency II: Prospective; risk calculation once a year. 
= Level of sophistication: 


Basel II: Three levels (depends on risk class; for example for operational risk: basic 
indicator approach, standard approach and advanced measurement approach). 


Solvency II: Five levels (from simplified standard formula to full internal model). 
= Valuation: 

Basel II: Market-based (market risk) and accounting based (credit risk). 

Solvency II: Balance sheet based. 


Solution 1.5 (Procyclicality) 

In general terms, procyclicality describes positive correlation between an economic quantity 
with the overall state of the economy. In the context of risk management and capital adequacy, 
this means that regulatory capital requirements may rise in times of stress and fall in times of 
expansion; see MFE (2015, Section 1.3.3). 


Solution 1.6 (Basel III) 

Basel III is a set of banking regulations developed by the Bank for International Settlements. 
It was agreed upon by the Basel Committee on Banking Supervision in 2010, was scheduled 
to be introduced from 2013 until 2015, but the implementation was extended to 31 March 
2019. Basel III overall aims at reducing the ability of banks to damage the economy by taking 
on excess risk. More precisely, it aims at strengthening the requirements from Basel II on a 
bank's minimum capital ratios (for example, the minimum amount of equity as a percentage of 
assets increases from 2% to 4.5% with another buffer of 2.5%, so in total 7%) and introduces 
requirements on liquid asset holdings and funding stability. As such, Basel III builds on and 
extends Basel II as opposed to replacing it. See the discussion in MFE (2015, p. 18). 


Solution 1.7 (Capital adequacy, leverage and liquidity coverage ratios) 


= Capital adequacy ratios (a minimum of 8% under Basel III) measure a bank's available 
capital as a percentage of its risk-weighted credit exposures. It measures the extent to which 
a bank has sufficient capital relative to the risk of its business activities; see Ingves (2014). 
It promotes financial stability and efficiency in economic systems throughout the world. 

" Leverage ratios measure the extent to which a bank has financed its assets with equity; see 
Ingves (2014). It does not matter what those assets are, or what their risk characteristics. 
Leverage ratios effectively place a cap on borrowings as a multiple of a bank's equity. 

= Liquidity ratios refer to the proportion of highly liquid assets held by financial institutions, 
to ensure their ongoing ability to meet short-term obligations and to encourage them to 
hold portfolios of highly liquid, lower-risk assets. 
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Solution 1.8 (Major failures) 
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Case Studies 


On 26 February 1995, Barings Bank, the United Kingdom’s oldest merchant bank, was 
declared insolvent after suffering losses of 827M GBP due to unauthorized trading by its head 
derivatives trader Nick Leeson working in the bank’s office in Singapore. The supervisors of 
Nick Leeson approved an arbitrage strategy whereby Nick Leeson was supposed to make 
small profits from buying/selling Nikkei 225 futures contracts listed on the Osaka Securities 
Exchange (OSE) in Japan and selling/buying them on the Singapore International Monetary 
Exchange (SIMEX). However, Nick Leeson began to exploit the fact that the SIMEX is a 
relatively small market which he could influence. For example if a large order to purchase 
futures on the SIMEX would come to Barings, he would immediately purchase the equivalent 
amount on the OSE which drove up prices on the SIMEX. He then sold the futures purchased 
on the OSE to his client in Singapore for the higher price making an arbitrage profit. He 
thus benefited from Barings’ customer trade by using insider information. This was the 
beginning of his speculations. It ended with the unexpected Kobe earthquake on 17 January 
1995 which caused a significant drop in the Nikkei 225 and created a huge loss due to a 
short straddle position in the Nikkei 225 which would have only been beneficial if the Nikkei 
225 had not moved significantly overnight. In large part, Nick Leeson was able to build 
these speculative gambles by writing options and thereby collecting option premia. He was 
eventually sentenced to six and a half years in prison in Singapore, but was released early 
in 1999 after being diagnosed with colon cancer. He has since recovered and appears as a 
keynote speaker at conferences on fraud, operational risk and computer security. Barings 
senior management barely accepted any responsibility, escaped criminal justice, never paid 
fines nor served a day in jail. Barings shareholders and bond holders lost their investment 
entirely. 

Lessons to take away are: Enforce division of labor and separation of power between 
front and back offices; follow the cash flow trail; understand your business; prepare clear 
assumptions about the unit profit earned (Leeson’s unit reported 500% of his projected 
profit in 1994 — supposedly from arbitraging Nikkei 225 futures between SIMEX and OSE; 
this should have been questioned); establish and enforce trading position limits; monitor 
open interests; and segregate agency trading (on behalf of a bank’s client) from proprietary 
trading (trading for the bank itself) to avoid a conflict of interest. 

The reason why Barings Bank is such an important case is that it checks all boxes in the 
definition of operational risk. There were certainly “failed internal processes” at Barings 
Bank. Nick Leeson was both floor manager and the unit’s head of settlement operations for 
Barings’ trading on the SIMEX. He could thus settle his own trades which circumvented 
internal control. He operated with no supervision from London, which made it easier 
to hide his losses, and there was a failure of “people” (including himself, of course) and 
risk-management “systems” as well as computer “systems” (with which he falsified trading 
records) at Barings Bank. Kobe earthquake was the “external event”. 


Note. See also Jacque (2015, Chapter 10). 


Metallgesellschaft (MG) incurred a loss of 1.3B USD through its U.S. oil subsidiary Metallge- 
sellschaft Refining and Marketing because of hedging a short position in long-term forward 
contracts (oil commitments) with a long position in short-term future contracts. Problems 
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with this (rather long-term) strategy appeared due to maturity mismatch, liquidity risk, 
operational risk and falling oil prices in 1993, for example. The gain due to the short 
positions was more than offset by a loss due to the long positions, which required a 1.9B USD 
bailout from MG’s bankers. 


Note. See also Jacque (2015, Chapter 6). 


On 13 June 1996, Sumitomo announced an accumulated loss of 2.6B USD due to copper 
trading activities by its trader Hamanaka Yasuo over several years. Due to its size (rumored 
to be twice as large as the next copper trader), Sumitomo and especially Hamanaka 
Yasuo were manipulating the copper market in two ways, accumulating large amounts of 
copper to building up futures and forward positions (this is known as market *squeeze") 
or to additionally create a shortage of copper (this is known as “cornering” the market). 
Hamanaka Yasuo was sentenced to eight years in prison in 1998 due to four counts of forgery 
and fraud and Sumitomo paid fines of 150M USD to the United States’ Commodities Futures 
Trading Commission and 8M USD to the United Kingdom’s Securities and Investments 
Board to settle charges of copper price manipulations. 


Note. See also Jacque (2015, Chapter 7). 


On 24 January 2008, Société Générale reported a fraud of 7.2B USD due to trader Jeróme 
Kerviel building huge positions in stock index futures hidden through fake offsetting trades. 
Due to lack of evidence, Jeróme Kerviel was charged with breach of trust, forgery and 
illegally accessing computers. He claimed that his actions were known to his supervisors 
and common, and that the losses were caused by panic selling by the bank. Skeptics also 
questioned how unauthorized trading of this magnitude could go unnoticed until 19 January 
2008 when unauthorized trading was uncovered and the bank closed these positions over 
three days starting on 21 January 2008; this was also the day when European stock markets 
suffered heavy losses of about 6% due to Société Générale closing out positions built up by 
Jeróme Kerviel. In a sequence of court cases, he was originally sentenced to five years of 
prison and to repay the loss but later on the latter was dropped and he overall served less 
than five months of detention. On 7 June 2016, a court condemned Société Générale for 
unlawful firing of Jeróme Kerviel. 

Some of the failures involved are failure to control, to follow the cash flow trail, to 
establish and enforce trading position limits, to monitor who are the key holders of open 
interests in different products and failure to break routine (by rotating employees). 


Note. See also Jacque (2015, Chapter 11). 


On 6 December 1994, Orange County (a county in the Los Angeles metropolitan area) 
declared bankruptcy due to a loss of 1.65B USD in a public investment pool which was 
supposed to be a mutual fund (over 7.5B USD) but turned into a hedge fund. The investment 
strategy (earn high incomes for the county without raising taxes through risky, leveraged 
positions in bonds) used by the county’s Treasurer-Tax Collector Robert Citron worked 
well under low or declining interest rates as long as money could be borrowed short term 
at lower interest rates and re-invested longer term at higher interest rates. The county’s 
finances were not suspect until February 1994 when the Federal Reserve began to raise 
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interest rates at which point many securities in Orange County’s investment pools began to 
fall in value. As a consequence, creditors for the county requested extra margin payments 
from the county at which point the maturity mismatch led to Orange County’s bankruptcy. 
Robert Citron pleaded guilty to six felony counts and three special enhancements. Charges 
also included filing a false and misleading financial summary to participants purchasing 
securities in the investment pool. He was sentenced to one year of work release and five 
years of supervised probation, and performed 1000 hours of community service. Part of 
the problem was that Robert Citron’s achievements (such as earning an excess 500M USD 
over the State of California’s investment performance between 1991 and 1993) earned him 
national fame and recognition which inflated his ego to the point that Citron could not 
tolerate criticism, for example Goldman Sachs (in 1993) and John Moorlach (challenger 
of Robert Citron in the June 1994 elections) criticized his investment decisions not taking 
into account alternative interest rate scenarios. The criticism was echoed widely by the 
business press which called it “a scandal waiting to happen”; nevertheless, Robert Citron 
was re-elected by a wide margin. Failures include the questioning of the roots of Robert 
Citron’s success (especially by himself), the communication of the leveraged nature of his 
investments, the limited accountability the Board of Supervisors subjected him to (which 
also failed at overseeing the flaws of the investment strategy), the missing red flag to 
investors given the investment pool’s high performance and the failure of state and federal 
regulation for not targeting municipal investment pools. 


Note. See also Jacque (2015, Chapter 14). 


LTCM was a hedge fund management firm which used absolute-return trading strategies 
combined with high leverage. The firm’s master hedge fund, Long-Term Capital Portfolio 
L.P. showed an impressive growth since its existence in 1994 (annualized returns after fees 
after the first three years: 21%, 43%, 41%), but collapsed in 1998 (because of a loss of 
4.6B USD due to the 1997 Asian and 1998 Russian financial crises), which stoked fear of 
systemic risk so that, on 23 September 1998, the Federal Reserve Bank of New York coerced 
a consortium of 16 financial firms (LTCM’s main creditors) to a bailout in the amount of 
3.6B USD; the fund was liquidated and dissolved in early 2000 and the consortium was 
paid back. Members of LI'CM's board of directors included Myron S. Scholes and Robert 
C. Merton, who shared the 1997 Nobel Memorial Prize in Economic Sciences for a “new 
method to determine the value of derivatives”. After the hedge fund’s failure, the theories 
of Merton and Scholes took a public beating. In its annual reports, Merrill Lynch observed 
that mathematical risk models “may provide a greater sense of security than warranted; 
therefore, reliance on these models should be limited”. 

There are several lessons to take away: For investors, the disclosing of minimal information 
(of proprietary trading strategies of hedge funds in general and the especially vague letters 
to LTCM's investors in particular); for hedge fund risk managers, that history does not 
necessarily repeat itself (the assumption that it does, thereby even relying on relatively 
recent history, was part of LTCM’s convergence trades); for hedge fund strategists, the 
fact that the same central theme, convergence of spreads, applied to different capital 
markets does not necessarily imply true diversification, and that leverage paired with 


99 


1 Solutions to Risk in Perspective 


100 


investing in illiquid assets is deadly; for lenders to hedge funds, do not blindly lend without 
access to vital financial information (such as leverage or off balance sheet over-the-counter 
(OTC) derivatives); for regulators, the fact that LTCM's loss of 4.6B USD was much more 
devastating for financial markets than Amaranth’s loss (see h)) due to the fact that the 
former used OTC derivatives (in comparison to Amaranth’s quasi-exclusive dealing with 
exchange traded derivatives) which allowed LTCM to amass more than a notional trillion 
dollars of derivatives contracts with a capital base of ‘only’ 4.5B USD. Note that the AIG 
debacle (see g)) is largely a replay (on a larger scale) of the LTCM debacle due to the OTC 
contracts involved, which were credit-default swaps in this case. 


Note. See also Jacque (2015, Chapter 15). 


AIG was a central player in the financial crisis of 2007-9 through its division AIG Financial 
Products (AIGFP) which sold massive amounts of credit default swaps (CDSs) *without 
putting up initial collateral, setting aside capital reserves, or hedging its exposure — a 
profound failure in corporate governance, particularly its risk-management practices"; see 
Angelides (2011, p. 352). This essentially bankrupted the entire AIG operation and forced 
the U.S. government to bail out the insurer for 182.3B USD as it was considered 'too big 
to fail. At the core of the debacle was that AIGFP largely underestimated the number of 
CDSs they needed to pay out, which led to massive losses as the financial crisis worsened 
during early 2008 and many companies began to default on their debt. 

In 1998, JP Morgan approached AIGFP with a proposal to write insurance against default 
on securitized products known as collateralized debt obligations (CDOs). Originally, the 
CDOs created by JP Morgan and other financial institutions were written strictly against 
corporate debt, but since 2003, due to the real estate boom, CDOs contained more and 
more mortgage debt secured by real estate (private label mortgage-backed securities). What 
AIGFP failed to realize was the emerging trend of subprime lending, AIG was insuring 
CDOs whose share of subprime mortgages was increasing (by late 2005 to 80B USD, about 
2096 of AIGs total CDS portfolio), yet they enjoy AIG's AAA rating. AIGFP never properly 
reserved for the credit default swaps that it was writing (CDS valuation models showed only 
a minor risk of default and CDS are not traditional insurance products subjected to the same 
stringent capitalization requirements and regulations as the rest of the insurance industry; 
AIGFP was thus over-insuring and under-reserving). In March 2005, AIG was downgraded 
to an AA rating which triggered an increase in collateral posting to counterparties of the 
CDS AIG had written. This led to further downgrades of AIG. When default rates on 
subprime mortgage-backed securities increased in 2008, disaster struck, AIG faced liquidity 
problems which essentially bankrupted the whole firm by September 2008. 

Lessons learned include the fact that there is no free lunch (making profits from CDSs 
without ever paying on losses), that in contrast to, say, life insurance contracts (where 
mortality tables are comparably stationary over time and deaths are comparably independent 
of each other), modelling the (credit) risk involved in CDSs is different since the probabilities 
of default on larger business loans are much more difficult to determine accurately and such 
defaults are highly correlated. 


Note. See also Jacque (2015, Chapter 16). 


Case Studies 


h) Amaranth Advisors was a hedge fund that collapsed in 2006 after losing over 5B USD as a 
consequence of a trading stategy involving natural gas futures. Speculation in gas futures 
works in the following way. We buy gas futures if we expect the gas price to rise, and sell 
gas futures if we expect the gas price to fall. For example suppose it is now June and we 
buy a future contract for December for 1000 USD. In October we can possibly sell such 
a contract for 1500 USD if news suggests gas prices will rise. If on the other hand we 
sell a future contract for 1000 USD we can possibly buy the same contract for 800 USD 
in October if news suggests prices will fall. Use of leverage, combined with the fact that 
futures exchanges only require the margin to be posted, may lead to very high returns if we 
call the direction correctly. 

Amaranth became heavily involved in energy trading from 2004 to 2005. In the aftermath 
of Hurricane Katrina star trader Brian Hunter made large profits in 2005 by speculating 
that the effect on gas production would raise prices. Emboldened by this success the firm 
put on similar highly leveraged trades in 2006 which made catastrophic losses when prices 
failed to rise as predicted. 

Risks that played a role included: 

m Market risk: A fluctuating price of natural gas clearly was the main risk. 

" Economic risk: This could be said to have affected the gas price. 

=" Operational risk: Very significant. A single trader, Brian Hunter, was given freedom to 
adopt almost unlimited positions with no proper supervision. 

= Agency risk and moral hazard: Hunter had personal incentives to up the level of risk 
due to his remuneration deal. 

» Liquidity risk (price): Again, very significant, because Hunter had assumed enormous 
positions that were very difficult to unwind quickly. 

= Liquidity risk (funding): This risk, which is related to the previous one, kicked in later 
when the hedge fund was unable to make margin calls at the futures exchange. 

=" Concentration risk: Very significant. The fund had an extremely large exposure to 
market risk in the gas futures market. 

A good answer might discuss sensible strategies designed to increase diversification and 

hedge some of the main risks. Amaranth was not a true multi-strategy fund and should 

never have committed so heavily to directional bets in a single market. A good answer 

would also discuss measures that would have reined in Brian Hunter, added more oversight, 

imposed trading limits and reformed the remuneration package that provided the wrong 

kind of incentives. 

Note. See also Jacque (2015, Chapter 5). 


Solution 1.9 (Market crashes) Ex. p. 10 


a) The Wall Street Crash of 1929 was a major stock market crash from 24 October to 29 
October 1929. The collapse of share prices on the New York Stock Exchange was the biggest 
one in the U.S. history and the beginning of the 12-year Great Depression. This was in 
parts attributed to the (mis)belief that the stock market would rise forever; a quote from 
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economist Irving Fisher at that time speaks for itself: “Stock prices have reached what 
looks like a permanently high plateau.” On Thursday 24 October 1929 the market lost 11% 
of its value and a high volume was traded. The ticker tapes used back then could not keep 
up with this volume which created panic among investors as stock prices became unclear. 
An attempt to intervene (through agreements on specific purchases well above the current 
market prices) failed and so the rally continued on Friday 25 October 1929. On Monday 
28, the Dow fell by 1396, on Tue by 12%; together, over 30M USD were lost. The volume 
of stocks traded that day should remain a record for almost 40 years. After recovering 
for several months, the Dow slid again from April 1930 to 8 July 1932. The market only 
reached the peak closing of 3 September 1929 on 23 November 1954. 

The Wall Street Crash struck after a speculative boom in the late 1920s which led many 
Americans to invest heavily in the stock market, often from borrowed money (over 8.5B USD 
was out on loan, more than the entire amount of currency circulating in the United States). 
The rising stock prices encouraged more people to invest, an economic bubble was the 
result. 

In 1933, U.S. Congress passed the Glass—Steagall Act, a law imposing the separation of 
banks into commercial banks (for example taking deposits, providing loans) and investment 
banks (for example underwriting, issuing and distributing stocks, bonds and other securities). 
Another consequence of the Wall Street Crash of 1929 was that stock markets worldwide 
put measures into place to suspend trading in the event of rapid declines; this could not 
prevent the event of 19 October 1987, though. 


On Monday, 19 October 1987, known as ‘Black Monday’, stock markets around the world 
crashed, starting in the Hong Kong Stock Exchange, then Europe and later North America; 
the S&P 500 index fell by 20.47%, the Dow Jones Industrial Average (DJIA) by 22.61%, the 
largest one-day percentage decline in the indexes’ histories. Trading was stopped at many 
exchanges because the IT infrastructure was not able to deal with high order volumes at 
that time, among other things. This bought the Fed time to increase liquidity. Additionally, 
companies started to buy their own stocks, partially to stabilize their prices. It took the 
DJIA about 15 months to recover. New Zealand’s market was hit especially hard, falling 
about 60% from its 1987 peak, and would take several years to recover. 

The causes of the crash include program trading, overvaluation, illiquidity and market 
psychology, and are still debated. Most blame was taken by program trading strategies for 
blindly selling stocks as markets fell, exacerbating the decline. Some economists believe 
that the speculative boom leading up to October (the DJIA grew by almost 100% since 
1985) was (already) caused by program trading, and that the crash was merely a return to 
normalcy. Other causes included international disputes about foreign exchange and interest 
rates and fears about inflation (for the first time in three years, the Fed increased the prime 
rate for short-time loans). 

After Black Monday, new rules of regulation where developed, such as “trading curbs” also 
known as “circuit breakers” which allow exchanges to stop trading in cases of exceptionally 
large price declines in indexes such as the DJIA. 


Note. Numerous accounts have been written on this event, a good place to start reading is 


Case Studies 


Bernhardt and Eckblad (2013) and the references therein. You may also want to consult 
Wigmore (1998). The Black Monday crash even made it into book-format, see Henriques 
(2017). An interesting comparison to the crash of 2008 is McKeon and Netter (2009). 


A flash crash is a very fast, deep fall in security prices typically due to automated or 
high-frequency trading. The flash crash on 6 May 2010 was due to a 4.1B USD trade on the 
New York Stock Exchange which resulted in a loss of the DJIA of about 1000 points and 
the bounce back within about 15 min. Overall, the trillion dollar U.S. stock market crash 
took about 36 min. As an aftermath, new regulations were put in place, but they proved to 
be inadequate in the 24 August 2015 flash crash. 

The causes of the crash are still debated, some early theories (such as a fat-finger trade) 
have been falsified. On 21 April 2015, trader Navinder Singh Sarao was accused of using 
an automated program to generate large sell orders to push down prices (spoofing; now 
banned), which he then canceled to buy at the lower market prices. His company, Nav 
Sarao Futures Limited, allegedly made more than 40M USD in profit from trading during 
the flash crash. 


Note. A good paper on the subject is Kirilenko et al. (2017). An event of similar nature was 
Knight Capital, 1 August 2012, with a loss of 440M USD which caused its stock price to 


fall over 70%. The incidence happened when a technician forgot to update trading software. 


The event was described as “technology breakdown”. 


Solution 1.10 (Natural catastrophes) 


a) Kobe earthquake struck on 17 January 1995 in Japan, the tremors lasted for approximately 


20 seconds. Up to 6434 people lost their lives, about 4600 from Kobe. It was Japan’s worst 
earthquake in the 20th century (highest Level 7 seismic intensity on the scale of the Japan 
Meteorological Agency) after the Great Kanto earthquake from 1 September 1923 which 
claimed more than 105000 lives. Nearly 400000 buildings were irreparably damaged by 
Kobe earthquake, roads (in particular, images of the collapsed elevated Hanshin Expressway 
made front pages of newspapers worldwide), rail bridges and 120 of the 150 quays in the 
port of Kobe (then the world’s 6th largest container port). Around 300 fires were triggered 
by the earthquake. Disruptions of water, electricity and gas supplies were a result. Improved 
steel-reinforcement specifications were part of the revised building codes from the 1981, but 
only applied to new structures. Local hospitals struggled due to being cut off from supply 
roads; some people had to be operated on in waiting rooms and corridors. 

The size of the earthquake caused a major decline in Japanese stocks, with the Nikkei 225 
index plunging by over 8% on the day after the earthquake. As a consequence, Barings Bank 
collapsed; see Exercise 1.8 a). Despite all devastation, the local economy recovered quickly 
(within a year import volumes through the port had recovered; in less than 15 months 
after the earthquake, manufacturing activity was at 98% of its projected pre-earthquake 
level). Approximately 1.2M people volunteered to help in the first three months after the 
earthquake. 1995 is thus often regarded as a turning point in the emergence of volunteerism 
as a major form of civic engagement (in 1995, 17 January was declared a national “Disaster 
Prevention and Volunteerism Day” and the week from January 15 to 21 as “Disaster 
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Prevention and Volunteerism Week”). 

Numerous improvements to disaster prevention were the result of Kobe earthquake. The 
Ground Self-Defense Forces were given automatic authority to respond to earthquakes over 
a certain magnitude, control over fire response was centralized in Tokyo and Kyoto, special 
disaster prevention routes were determined and earthquake-proof shelters and supplies were 
built in public parks, to name a few. 


Hurricane Katrina hit land in Florida’s southwest in the morning of 25 August 2005. The 
storm weakened over land but regained hurricane status over the Gulf of Mexico. The storm 
quickly grew to a Category 5 hurricane after entering the Gulf with maximum sustained 
winds of 280 km/h in the morning of 28 August 2005. It made its second landfall on 29 
August 2005 near Buras-Triumph, Louisiana, and a third one near the Louisiana—Mississippi 
border as Category 3 hurricanes. Overall, 1836 people died (many more missing), damage 
to property was about 108B USD, insurance losses rose to 62.2M USD. Especially New 
Orleans was hit badly, two broken levees led to floods of 80% of the city, up to 7.6m deep. 
Drinking water was contaminated with flood water, about one million people were homeless, 
about five million had no electricity (with estimates of reinstatement work ranging up 
to two months). Riots and violence broke out over the last food and neighborhoods in 
New Orleans and there was an increasing risk of epidemics. Heavy rainfalls of a by then 
weakened hurricane Katrina reached as far north as Ontario and Québec in Canada. 

Evacuation of New Orleans would have been the mayor’s and governor’s responsibility, 
but they failed to deploy school buses ready for evacuating people, also local police was 
partly participating in the riots instead of being on duty. Also then president G. W. Bush 
was criticized for not providing enough help, after which he promised an emergency relief of 
10.5B USD. He also called the event a surprise although the Scientific American published 
an article in October 2001 which described this scenario. 

Hurricane Katrina also damaged or destroyed 30 oil rigs in a region where about 25% 
of the U.S/s oil production took place. This ultimately led to higher oil prices worldwide, 
U.S. gas prices increased by 50% to 200%, gas had to be rationed in the south. Hurricane 
Katrina also had large impacts on the environment, with damages to coastlines and nature 
reserves. Water pumped out of New Orleans was infested with bacteria, heavy metals and 
pesticides. Due to many people being resettled in Texas, the social structure of New Orleans 
changed significantly because of hurricane Katrina. 

See also Exercise 1.8 h) for a collapsing hedge fund related to hurricane Katrina. 


The flooding of the Chao Phraya river basin (Bangkok) starting at the end of July 2011 
was a consequence of persistent rainfall. Some key data are: 


" Flood duration: 158 days 

" Flood magnitude: 7.9 

= Return period: 10-20 years 

= Deaths: 815 (three missing) 

" Economic Loss: 30B USD 

= Expected Insured Loss: 12B USD 


Discussion 


As the world’s second-largest producer of hard disk drives, Thailand produces about 25% 
of the world’s production. As many of the factories in the Bangkok area were flooded, most 
hard disk prices almost doubled globally, which took about two years to recover. This is a 
prime example of business interruption risk which is considered as one of the, if not the, 
most important risk facing the (re)insurance industry; see the Allianz Risk Barometer 2018 
where cyber and business interruption top the list. 


Discussion 


Solution 1.11 (Risk and uncertainty) 

There are various definitions of risk, all have in common the chance of a loss, so the possibility 
of losing something of value. Although precise future outcomes are never known, risk typically 
refers to a predictable or quantifiable situation (for example probability and size of a loss known) 
whereas uncertainty involves imperfect or unknown information and is thus unpredictable or 
uncontrollable (for example when the probability of a loss is known, but not its size). 


Solution 1.12 (The Q in QRM) 

Going forward, quantitative techniques have to go (in a well-balanced way) hand in hand with 
qualitative considerations. You may want to discuss examples where one Q clearly has to 
dominate the other as well as vice versa. Also, look for examples where both Qs have to go 
hand in hand. Whichever, when discussing both the usefulness and criticism of ‘Quants on 
Wall street’ it pays to recall the 8 October 2008 Forbes interview given by Shreve (2008), where 
the following quote appeared: 


The quants know better than anyone how their models can fail. For banks, the only 
way to avoid a repetition of the current crisis is to measure and control all their 
risks, including the risk that their models give incorrect results. On the other hand, 
the surest way to repeat this disaster is to trust the models blindly while taking 
large-scale advantage of situations where they seem to provide trading strategies 
that would yield results too good to be true. Because this bridge will be rebuilt, 
the way out of our present dilemma is not to blame the quants. We must instead 
hire good ones — and listen to them. 


For a somewhat contrarian view, see Triana (2008). In this contribution, the following statement 
is made: 


One excruciatingly urgent lesson from this crisis is that we must develop unlimited 
skepticism and irreverence toward the application of mathematical models in the 
markets. We need to stop taking quantitative gadgets so seriously, and start to take 
the perilousness of quantitative gadgets more seriously. We can’t let this happen 
again. 


Perhaps the author of the latter paper should have reread first the key findings of Danielsson 
et al. (2001), especially the criticism on the use of value-at-risk and the excessive power of the 
rating agencies. 
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Ex. p. 10 Solution 1.13 (Trends in financial regulation) 


a) The idea that risk regulation has become too complex is often expressed by practitioners. 
A regulator who has expressed this view is A. G. Haldane of the Bank of England; see 
Haldane and Madouros (2012). One possible approach to this part of the exercise would be 
to discuss the extent to which you agree with the arguments in this article. To give the 
flavour we quote from the introduction: 


So what is the secret of the dog’s success? The answer, as in many other areas of 
complex decision-making, is simple. Or rather, it is to keep it simple. Studies 
have shown that the frisbee-catching dog follows the simplest of rules of thumb: 
run at a speed so that the angle of gaze to the frisbee remains roughly constant. 
Humans follow an identical rule of thumb. Catching a crisis, like catching a 
frisbee, is difficult. Doing so requires the regulator to weigh a complex array of 
financial and psychological factors, among them innovation and risk appetite. 
Were an economist to write down “crisis catching” as an optimal control problem, 
(s)he would probably have to ask a physicist for help. Yet despite this complexity, 
efforts to catch the crisis have continued to escalate. Casual empiricism reveals an 
ever-growing number of regulators, some with a doctorate in physics. Ever-larger 
litters have not, however, obviously improved watchdogs’ frisbee-catching abilities. 
No regulator had the foresight to predict the financial crisis, although some have 
since exhibited supernatural powers of hindsight. So what is the secret of the 
watchdogs’ failure? The answer is simple. Or rather, it is complexity. Haldane 
and Madouros (2012) explores why the type of complex regulation developed over 
recent decades might not just be costly and cumbersome but suboptimal for crisis 
control. In financial regulation, less may be more. 


And from the conclusion: 


Modern finance is complex, perhaps too complex. Regulation of modern finance 
is complex, almost certainly too complex. That configuration spells trouble 
[...]. Because complexity generates uncertainty, not risk, it requires a regulatory 
response grounded in simplicity, not complexity. Delivering that would require 
an about-turn from the regulatory community from the path followed for the 
better part of the past 50 years. If a once-in-a-lifetime crisis is not able to deliver 
that change, it is not clear what will. To ask today's regulators to save us from 
tomorrow's crisis using yesterday's toolbox is to ask a border collie to catch a 
frisbee by first applying Newton'xs Law of Gravity. 


b) Evidence of a movement away from internal models back to simpler standardized approaches 
can be found in operational risk modelling, for example, in the form of the advanced 
measurement approach (AMA) versus the simpler standardized approach. In BIS (2016b), 
the Bank for International Settlements advocates the use of a simpler standardized approach 
for operational risk modelling, pointing out the by now available data and corresponding 
improvement of the such approaches. Cohen (2016) agrees that the AMA depends on the 
choice of many somewhat arbitrarily chosen parameters which leads to a convoluted model 
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that is hard to understand and justify. Peters et al. (2016) argues against discarding the 
AMA, pointing out limitations of standardized approaches and proposing an improvement 
and standardization of the AMA instead. 


Solution 1.14 (Regulation and credit provision) Ex. p. 11 
A place to start is Admati and Hellwig (2014). This text takes a critical stand, favoring stronger 

regulation, especially at the level of leverage, and dismantles the banks’ views that “increased 

regulation strangles credit provision”. 


Solution 1.15 (Shadow banking and insurance) Ex. p. 11 


a) “Shadow banking” refers to a collection of non-bank financial intermediaries that provide 
services similar to traditional commercial banks but outside normal banking regulations. 
Former U.S. Federal Reserve Chair Ben Bernanke provided the following definition, including 
examples, see Bernanke (2013): 


Shadow banking, as usually defined, comprises a diverse set of institutions and 
markets that, collectively, carry out traditional banking functions — but do so 
outside, or in ways only loosely linked to, the traditional system of regulated 
depository institutions. Examples of important components of the shadow banking 
system include securitization vehicles, asset-backed commercial paper [ABCP] 
conduits, money market funds, markets for repurchase agreements, investment 
banks, and mortgage companies. 


b) An example for shadow insurance is if life insurers use reinsurance to transfer liabilities to 
less regulated and unrated off-balance-sheet entities within the same insurance group; see 
Koijen and Yogo (2016). For an interesting newspaper discussion of the shadow insurance 
case discussed in this paper, see Chicago Tribune (2016) and also Hepfer et al. (2016). 
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Solution 2.1 (Notions of capital) 


a) 


b) 


c) 


'The financial analyst attempts to value the assets and liabilities of the firm and hence is 
mainly concerned with the equity capital. 

The main concern of the chief risk officer is to take risk-mitigating action to prevent 
insolvency and hence (s)he is mainly concerned with the impact of a reinsurance treaty on 
the economic capital. 


The regulator follows the rules laid down in the regulatory capital framework. 


Solution 2.2 (Different notions of financial distress) 


a) 


c) 


Illiquidity means that a company is not able to raise the cash to make payments when they 
are due (also known as funding liquidity). This could occur if the company holds too many 
illiquid assets that cannot readily be converted to cash. 

Insolvency means that a company has negative equity, that is its assets are worth less 
than its liabilities. 

Default is the failure to meet contractual obligations, for example failure to make a 
scheduled interest payment on a loan. 

Bankruptcy refers to the legal process instigated when a firm cannot pay its debts. The 
result may be a restructuring of the debt allowing the firm to continue in business or an 
orderly liquididation of the firm's assets. 

A firm could have negative equity capital at a particular point in time (thus being insolvent) 
but might still be able to raise sufficient cash to meet any payment obligations. Such a 
situation is typically not sustainable in the long run. 

A firm could fail to pay back a short-term loan due to a lack of liquid assets even though the 
value of those assets exceeded the value of the firm's liabilities so that the firm is solvent. 


Solution 2.3 (Valuation of a real-estate investment) 


a) 


b) 


This corresponds most closely to book value (and would not generally be used for real 
estate valuation). 

'This corresponds to mark-to-model with objective inputs. Note that, it is not a mark-to- 
market valuation because the good (the flat) being valued is unique and there is no current 
market price available. 
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2 Solutions to Basic Concepts in Risk Management 


c) This corresponds to mark-to-model with subjective inputs (due to the ad hoc adjustment 
of the price). 


Solution 2.4 (Translation invariance of risk measures) 

For a loss L and a translation invariant risk measure 9 let | = o(L). Translation invariance 
implies that o(L—1) = o(L) — 1 = e(L) — o(L) = 0, so reducing the loss by l makes L acceptable. 
Reducing the loss by l corresponds to adding an amount of capital l. 


Solution 2.5 (Subadditivity of risk measures) 

Subadditivity is consistent with the concept that diversification reduces risk. Moreover it 
permits decentralized risk measurement by sub-units of a firm (for example to bound o(L1 + L3) 
by a constant c, it suffices to bound o(L;) by cj, j € (1,2), for c14-c» € c, since, by subadditivity, 
o(L1+ L2) € o(L1) + o(L3) € c-- co € c) and removes any incentive to split the firm to reduce 
capital requirements. 


Solution 2.6 (VaR and expected shortfall) 

a) VaRQ(L) = F€ (o) = inf[z € R: F(x) > a}, that is, VaRa(L) is the a-quantile of the loss 
distribution function F of L. 

For E(|L|) < oo (or the weaker requirement E(max{L,0}) < co), expected shortfall of L 
at confidence level o is given by ES,(L) = <4 f 1 VaR,(L) du. 

b) VaRA(L) exists for all distributions and is relatively easy to interpret. ES4(L) is subadditive 
(thus coherent) and takes loss severity into account (it depends on the whole tail of the loss 
distribution beyond VaR4(L), not just the a-quantile of the distribution function F of L). 
Note. Both risk measures are law invariant, monotone, translation invariant and positive- 
homogeneous. 


Solution 2.7 (Superadditivity scenarios for VaR) 
Superadditivity of VaRq often appears under the following scenarios, see MFE (2015, Sec- 
tion 2.3.5): 


" Independent losses with highly skewed distributions; see also Exercise 2.27. 

" [ndependent losses with continuous light-tailed distributions and a rather low confidence 
level o; see also Degen et al. (2007), McNeil and Smith (2012) and Exercise 2.26. 

" Losses with continuous symmetric distributions and special dependence structure; see also 
Embrechts et al. (2013b) and Exercise 8.21. 


m Independent losses with continuous, very heavy-tailed distributions; see also Exercise 2.28. 


Solution 2.8 (Additivity for two linearly dependent random variables) 

Translation invariance and positive-homogeneity of VaR imply that VaR4(.X +Y) = VaRa(X + 
aX +b) = VaRS((14- a).X +b) = (1 +a) VaRa(X) +b. Again by translation invariance and 
positive-homogeneity, VaR4( X) + VaRa(Y) = VaRa(X) +a VaRa(X) +b = (14- a) VaRo(X)+ 
b = VaRa(X + Y), so VaR, is additive. 


110 


Basic 


Basic 


Solution 2.9 (Risk-neutral valuation for interest-rate derivatives) 

a) p(0,1) = 1/(1 + ro) = 1/1.015 z 0.9852. If rı = 0.01, then p(1,2) = 1/(1 + r1) =1/1.01 e 
0.9901 and if rı = 0.02, then p(1,2) = 1/(1 + r1) = 1/1.02 © 0.9804. 

b) At t= 1 the value of the long bond is equal to p(1,2), so q = Q(ri = 0.01) solves 


0.969729 = p(0.1)( ae er oe ) 
l SPA TSEOU V 740.02)" 


This leads to the solution q = 0.4. 
c) The risk-neutral price of the floor contract is given by Vo = p(0,1)(q-1+(1—q)-0) e 0.3941. 


Solution 2.10 (Mapping of a stock portfolio affected by exchange rates) 
a) The value V, of P at time t in EUR is 


Vi = Si + A2Staet HF = Ay exp(Zi1) + Aa exp(Zi2 + Ze). 


The mapping f(t, z) satisfying V; = f(t, Z+) is thus f(t, z) = A1 exp(z1) + A2 exp(za + za). 
b) Let Xi41 = Z41 — Z denote the vector of risk factor changes with components X,,4,; = 
Zi4ij — Zuj, J € {1, 2,3}. The value V4 of P at time t + 1 in EUR is then 


Viii = f(t 1, Zi) = f(t 1, Zi + Xia) 
= A1exp(Zri + Xon) + Azexp(Zta + Xt41,2 Zi + Xo). 


The one-period loss L;,1 = —(Vi41 — V) is thus 


Li = —(A1 exp( Ze. + Xt41,1) + Azexp(Zrao + Xt41,2 + Zia + Xia) 
— A, exp(Z4,1) — A2 exp( Z: 2 + Zua)) 
= -(X exp(Z¢,1)(exp(X¢41,1) — 1) 
+ A2 exp(Zi,2 + Zia) (exp(Xcci2 + Xe) — 1)). 


c) The linearized one-period loss is given by 


3 
Lay =- (su Zi) t+ fat, Z)Xui;) 


j=l 
= —(A1 exp( Z1) Xt411 + A2 exp(Zia + Zt,3)(Xt+1,2 + Xt41,3)) 
= — (Sti Xcag + A2Sgaet ™ (X13 + Xt41,3))- 


With the portfolio weights w; = A1$11/V; and wi» = A2, 3e HF / Vi, we have 


LÀ, = -Vi(wii Xa + wea(Xeiia + Xt41,3))- 
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Solution 2.11 (Properties of VaR and expected shortfall) 

a) Monotonicity: Let Lı ~ Fj and Lə ~ F> be two random variables such that Lj < Lə 
almost surely. Then Fj(x) = P(Lj € x) > P(Lo € x) = Fo(x) for all x € R and thus 
VaRo(Li) = Ff (a) € F$ (a) = VaRa(L2) for all a € [0, 1]. 

Note. To convince yourself that the last inequality is correct, sketch Fi, F5. 
Translation invariance: Let L ~ F and l € R. Then Fr,;(z) 2 P(L 1x x) 2 F(x—l), 
so Ff (a) = F* (a) +1, and thus VaRa(L +1) = VaRa(L) +1 for all o € [0, 1]. 
Positive-homogeneity: Let L ~ F and A > 0. Then Fjr(x) = P(AL € x) = F(x/X), so 
Fy, (a) = AF* (a), and thus VaRq(AL) = A VaRa(L) for all a € [0,1]. 


b) Since integrals are monotone and linear. 


Solution 2.12 (VaR and ES for continuous distributions with finite mean) 

a) For an Exp(A) distribution, F(x) = 1 — exp(—Ax), x > 0. Solving F(x) = a with respect 
to x, we obtain that VaR,(L) = F* (a) = —log(1 — o)/A. Since flogv = vlog v — v, we 
obtain that 


ESS(E)- z [ P= xm [osa = 0) au 


0 
1— 


1 [vlog v — v] 
— => l = a aa 
CER NY LE d Au - a) 


By l'Hópital's rule (case ‘0/0’), lim,.,o, vlogv = 0 which implies that ES,(Z) = (1— 
log(1 — o))/A. 


Alternatively, since the exponential distribution is continuous we can apply MFE (2015, 
Lemma 2.13): 


1 : 1 oo =e 1—log(l—a 
ES4(L) = E(LItr»vaR,) = ent ee A dx = e 


b) Note that L = exp(u+oX) for X ~ N(0,1). Hence, F(x) = P(exp(u+oX) € x) = P(X < 
(log(z) — u)/o) = 9 ((log(z) — u)/c) and thus 


VaR4(L) = F* (a) -exp(u--o9 !(o), a€ (0,1), ER, o » 0. 


This implies that 


1 1 e" 1 EST e" ge 
ESq(L) = FO 2 i ob (u) = i - 
(L) Pc Jj (u) du pec c du E pere m e°* (x) dx 
oo c? [2 oo E 
ca ii i Ale pape NER call f l -(e-0)?/2 gy 
1— a Je-1(a) VZT l—o Je-1(o) V2m 
eto? /2 poo 1 2 elite? /2 poo 
= ES fag Vay = | d 
y-z-c l-a h 1(a)-o Von. oe ®-1(a)—o 9t) dy 
eite? [2 -— aug" ae 
ioa dBA) — 0) mua Gow BO - 9 (9) 
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c) Solving F(x) = 1—-exp(—,/z/10) = u with respect to x leads to F€ (u) = (—10log(1—u))? = 
100(log(1 — u))? and thus VaRo,.99(L) = 100(log(1 — 0.99))? ~ 2120.7592. For ESo.99(L), 
substituting z = log(1 — u) (so u= 1 — exp(x)), one obtains that 

100 t 

1 — 0.99 Jo.99 


log 0. Os 


ESo.99(L) = (log(1 — u))? du = 10 000 f" x’ exp ada. 


We can use integration by parts twice to obtain 


log 0.01 5 ? log 0.01 log 0.01 log 0.01 
r^*exprdz-— |x* expr —2| |xexpx — expzdz 
p p p p 
—00 — OO — oo 


—oo0 


.. (log0.01)? (98001 1 ) .. (log 0.01)? — 2 log(0.01) + 2 
|. . 100 100 100/ 100 


and thus ESo.99(L) = 100 i d 01)? — 21og(0.01) + 2) ~ 3241.7933. 
d) Solving F(x m 1 — (14- z)-? = u with respect to x, we obtain F*-(u) = (1— u)-V? — 1, so 
VaR4(L) = F^ (o) = (1— o)-V/? — 1 for a € [0, 1). It follows that, for o € [0, 1), 
1 n 1 1 
E -1/0 . = T 1— uyt du— 1 
Saf rz] 0- 1)du esc ( u) u 
1 (hs u)- 1/041 a - pp Uer ei ay uv 
l-a 5-1 " (1 — o)(1 — 1/0) 0—1 
. 0(VaRa(L) +1) ix 0 VaR4(L) +1 
i 0-1 i 0-1 
These calculations only make sense for 0 > 1, otherwise we would have an infinite-mean 
model. 
Solution 2.13 (VaR and ES for a distribution function with jumps) Ex. p. 15 


a) Figure S.2.1 shows the graph of F. 
b) Note that for all u 2 F(3—) — 3/4, we have that 


B u € (3/4,8/9], 
l/yl—u, we (8/9,1]. 
Since a = 0.85 satisfies a € (3/4, 8/9], we obtain that VaRo.35(L) = Ff (0.85) = 3. For 


a = 0.95 > 8/9 we have that VaRogs(L) = F* (0.95) = 1/,/1— 0.95 = V20 = 2/5 = 
4.4721. 


c) For all a € (F(3—), F(3)] = (3/4,8/9], we have 


mah tuo (fae [ae Venu D 


(3(8/9 — a) + [-2/1—- ulk) = 8/3 —3a+2/3 _ 10/3 —3o 


l-a l-a 


F-(u)- 


ESa( 


=] —a 
and thus ESo.35(L) = 47/9 ~ 5.2222. 
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1.0 


_ 


F(x) 
0.6 
l 


0.4 


0.2 


Figure S.2.1 Graph of F as defined in (E1). 


Ex. p. 15 Solution 2.14 (VaR for a binomial model of a stock price) 
The binomial tree in Figure $.2.2 shows that the loss after two years is Li,» = —8.16 with 


0.8 $142 = 108.16) => Lise = —8.16 
$i = 104 
Qo = = 
$2 — 99.84 => L5 = 0.16 
03 $2 — 99.84 => L5 = 0.16 
0.2 Sito = 92.16) => Lise = 7.84 


Figure S.2.2 Binomial tree for the stock price with S; = 100. 


probability 0.8? = 0.64, Li,2 = 0.16 with probability 2 - 0.2 - 0.8 = 0.32 and Li+2 = 7.84 with 
probability 0.2? = 0.04. Therefore, the loss distribution F Lise Of Liza is 


0,  z€ (—oo,—8.16), 


F, (æ) o 1095. æ € [7816,010) 
PI = 
"hs 0.96, z € [0.16, 7.84), 
1, x € [7.84, co), 
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with corresponding quantile function 


—816, u € (0,0.64], 
Fi,,.(u) = 40.16, u € (0.64,0.96], 
7.84, u€ (0.96, 1]. 


We thus have VaRo.z(Li42) = VaRo.95(Lt+42) = VaR 0.96 (Lt+2) = 0.16 and VaRo.99(L44.3) = 
7.84. 


Solution 2.15 (VaR and ES for a discrete distribution) 

Note that losses are profits with negative sign. The business line A thus faces a loss of -1000 with 
probability 0.82 + 0.04 + 0.04 = 0.9 (under w1, w2 and ws), 0 with probability 0.04 + 0.02 = 0.06 
(under ws and w4) and 10000 with probability 0.04 (under wg). Therefore, the loss distribution 
function Fa of business line A is given by 


0, x € (—co, 1000), 


Fala) = 409 © [-1000, 0) 

d — 

0.96, x € [0, 10000), 
1, x € [10 000, oo). 


The loss distribution function FR of business line B is the same as FA. 


a) Since Fa = Fg =: F, it suffices to compute VaR4(L) and ES,(L) for L ~ F and a = 0.95. 


Note that, VaRo.gs(L) = 0 (as 0 is the smallest number where F is at least as large as 
0.95). For ESo.95(L) = a lee F*(u) du, note that F* (u) = 0 for u € [0.95, 0.96] and 
F* (u) = 10000 for u € (0.96, 1]. Hence 


0.96 1 
ES ss(Ày = isl Odu+ | 10000du) = 20 - 10000(1 — 0.96) = 8000. 
1 — 0.95 \ Jo.95 0.96 
b) Let La ~ Fa, Lg ~ Fp and let FA4p denote the distribution function of the aggregated 
loss La + Lp. The aggregated loss La + Lpg is -2000 with probability 0.82 (under w1), -1000 
with probability 0.04 + 0.04 = 0.08 (under wz and wa), 0 with probability 0.02 (under w4) 
and 9000 with probability 0.04 + 0.04 = 0.08 (under ws and we). Therefore, 


0, x € (—oo, 2000), 


0.82, x € [-2000, —1000), 
Fayn(x) 240.9, x € [-1000,0), 

0.92, æ € [0, 9000), 

Í; x € [9000, oo). 


Then VaRo.95(La + Lg) = 9000. Furthermore, since VaR,(La + Lg) = 9000 for all 
u € [0.95,1], 


1 


1 
ESoo5(La + LB) = 7—75 J, gq 2000 du = 9000. 
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As we see from the above computations, VaRo.95(La + Lg) = 9000 > 0+ 0 = VaRo.95(La) + 
VaRo.95(Lp) (value-at-risk is not subadditive in general) and ESo.95(La + Lg) = 9000 < 
8000 + 8000 = ESo.95(La) + ESo.95(Lp) (expected shortfall is subadditive). 


Solution 2.16 (Expected shortfall for t distributions) 


Po — T((v41)/2) 
Let L ~ tp. For c, = TTUT” 


falt) =c(l+22/v) F, zER, 


denotes the density of t,. Substituting x = tz! (u) (so dx = 1/f;,(t7'(u)) du and thus 
fa (x) dv = du) leads to 


= ——~ hito (09) t; (0? /v). 


These calculations only make sense for v > 1, otherwise the mean of L does not exist. 


Note. By linearity of the integral, 


ESa(L) = p + o fi (t7 (0) + tz (0)*/v) 


is the expected shortfall of L ~ t, (u, a?). 


Solution 2.17 (VaR and ES for bivariate normal risks) 

The annualized volatility c4 is the standard deviation of an instrument’s yearly log-returns. 
According to convention (and motivated by the square-root-of-time rule), the daily volatility og 
is obtained by taking oa/v250. The daily volatilities of the log-returns X44, = (Xt41,1, Xt41,2) 
at day t + 1 of the two stocks are thus oa = 01/V250 = 0.2/4/250 and eq» = 02/V250 = 
0.25/4/250, respectively. Therefore, their covariance matrix is 


, 04 0.40.2. 0.25 
E es Odi POd10d2\ — 250 "vmsoy»50 | | 1 (16 8 
7 UH C \poaiga2 043 ~ 10.4-92. 025 0.0625 10948 25]. 
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We thus have that X41 = (X+¢41,1, Xt41,2) ~ N2(0, X). As in Solution 2.10, the daily one-period 
linearized loss random variable is 


Lea = —Vi(wi Xia + weXt41,2) = -Vw Xi 
~ N(-Ww'0, (V)? w Ew) = N(0, V2w'YXw), 


where w = (w1, w2) = (0.7, 0.3) are the given portfolio weights. With V; = 10° we thus have 
that 


VaRo.s9( LÂ ,) = 0+ V2w!/Yw 971 (0.99) ~ 26 979.62, 
ESo.99 (LA ,) = 0 + Y V2w £w ó(9-!(0.99)) /(1 — 0.99) = 30 909.59, 


where the numbers are interpreted in EUR. 
Redoing the calculations with p = 0.6 leads to the (larger) values VaRo.s9( L61) = 28 615.02 
and ESo.99(L.1) = 32 783.22. 


Solution 2.18 (Basic historical simulation) 

a) The value of the portfolio today (in EUR) is V, = 25, = 2000. In terms of the risk factor 
Zi = log $4 and the one-month risk-factor change X+41,1 = log(St+1,1/ 94,1) = Zt+1,1— Žt,1, 
the one-month linearized loss is given by 


Er = —ViXt+1,1 = —2000.X7441,1- 


Based on the given last n = 10 realizations of X++1,1, the last n realizations L2, ,4,..., LÂ 
of Lea are thus equal to 322, —102, 8, 50, 80, —210, —104, 58, —382, —8; recall that 
the values in Table E.2.1 are in 96. An estimate of VaRo.9(LA.;) based on the empirical 
distribution function of L2, ,4,..., LA is the [na] = [10 - 0.9] = 9th smallest (or second 
largest) loss, so 80. 

b) The value of the portfolio today (in EUR) is V; = $,14-105,» = 2000. The portfolio weights 
are w1 = Si /V; = 0.5 and w2 = 105,5/V; = 0.5. In terms of the one-month risk-factor 
changes X;,1,; = log(St+1,j/St,j) the linearized loss is thus given by 


Dos = —Vi(w1Xt41,1 + woX141,2) = —2000(0.5.Xt411 + 0.5.X 1.112). 


Based on the given last 10 log-returns, the last 10 realizations L2.,,;,..., LÂ of the 
linearized loss L5 are 243, —82, 0, 40, 70, —150, —72, 66, —300, 0. An estimate of 
VaRo.9(LA.,) based on the empirical distribution function of L2 , | ,,..., L2 is the [na] = 
[10 - 0.9] = 9th smallest (or second largest) loss, so 70. 


Solution 2.19 (Axioms of coherence for standard deviation) 
First note that o(L) is well-defined on M. We now consider the axioms one by one. 


m Monotonicity does not hold: For a counterexample take a random variable Lı ~ U(0, 1) 
and let Lə = 0.5L4 + 1, so that Lə ~ U(1,3/2). Then Lı € Lə almost surely but o(L1) = 
/A/12 > \/1/48 = o(L2); recall that U(a,5) has variance (b — a)?/12. 
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= Translation invariance does not hold: Take any L € L? and! 4 0. Then o(L +1) = 
yvar(L +1) = vvar L = e(L) 4 e(L) + l. 
= Subadditivity. This axiom holds. For all L4, Lə € LŽ, 
var(L4 + L3) = var(L4) + var(L2) + 2 corr( L1, L2) var Lı var Le 
< var(L) + var(L2) + 2\/var Lı vvar Lo = (var Lı + vvar L2)’, 


so that 


o(L, + L3) = 4/ var(L4 + L3) € vyvar L4 + vvar La = o(L1) + o(L2). 


= Positive-homogeneity: This axiom holds since for all L € L? and every A > 0 we have that 


o(AL) = J/var(AL) = yX var(L) = Avvar L = Ao(L). 


Solution 2.20 (Superadditivity of VaR for two iid Bernoulli random variables) 


ind. 


If Li, Lo ~ F with 


0, x <0, 
F(r)—-41-p, ve (0,1), 
1, y], 


—oo0, u=)0, 
F (u) = 0, x € (0,1 — p], 
1,  xe(l-p,1 


Since L; € {0,1}, 7 € {1,2}, we know that Lı + Lo € {0,1,2}, and the corresponding 
probabilities are P(Li + Lə = 0) = P(Li =0, Ig = 0) = (1 — py^, P(Li + Lo = 1) = P(Li = 
1, Lg = 0)+P(L1 =0, Le = 1) = 2p(1— p) and P(L1+L2 = 2) = p°. Hence, L14- Lo ~ Flitt 
for 


0, xz <0, 
(Lap. z € [0,1), 
Prior) = 2 2 
(1—-p) -2p(1—p)—1—-p, z€[L2) 
1, £ > 2, 
with corresponding quantile function 
—oo, u=0, 
0; uwe(0(1-pr] 
Ffi +r (u) = 2 2 
1, u € ((1— p)*,1 — p°], 
2, utc (1 - p, 1] 


Consider the following cases: 
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= For a € (0,(1 — p)?|, we obtain that VaRa(L4 + L3) = 0 = 0 + 0 = VaRq(L1) + VaRa(L2), 
so that VaR is (sub)cadditive. 

= Foro € ((1— p)?,1— p] (note that 1 — p < 1 — p?), we obtain that VaRa(Lı + L2) = 1 > 
0 +0 = VaR4(L1) + VaRa(L2), so that VaR. is superadditive. 

= Foro € (1— p,1— p°] (note that 1 — p > (1— p)?), we obtain that VaRa(L1 + L2) —1 < 
1+1 = VaRa4(L1) + VaRa(L2), so that VaRq is subadditive. 

= Foro € (1— p?,1], we have that VaRa(Lı + L3) = 2 = 1 + 1 = VaRa(L1) + VaRo(L2), so 
that VaR, is (sub)additive. 

Therefore VaRa, a € (0, 1), is superadditive if and only if a € ((1 — p)?, 1— p]. 


Note. À more general result can be found in Exercise 2.27 and, even more general, in Hofert 
and McNeil (2014). 


Solution 2.21 (VaR under strictly increasing transformations of a loss) 

Let a € (0,1). Since h is continuous and strictly increasing, the inverse h^! exists and is also 
continuous and strictly increasing. Let Fr and Fh(z) denote the distribution functions of L 
and h(L), respectively. Then Fhz)(x) = P(h(L) € x) = P(L < h-l(a)) = Fr(h-!(x)) and 
thus, by solving Fr(h-!(x)) = o with respect to x, we obtain that VaR4(h(L)) = Fray (2) = 
h(Fr (o)) = h(VaRQ(L)). 

Note. A more general result can be found in MFE (2015, Proposition A.5). 


Solution 2.22 (Subadditivity of VaR for bivariate normal random variables) 
a) Since Lj ~ N(n, 03), VaRa(L;) = uj + o;9  ! (a), j € {1,2}, where © denotes the distri- 
bution function of N(0, 1). Furthermore, Lı + La = (1,1) (7) ~ N(Q.1) (2). (LEQ), 


where 
2 
y= | 01 iu 
P0102 095 
denotes the covariance matrix of (L1, La). Therefore, L44- Lo ~ N(qq 4-2, o? +2p0102 + o2) 


and thus VaRa(Lı + L2) = pi + ua + yet + 2p0103 + 029 (a). 
b) For a € [1/2,1), ®-!(a) > 0 and thus, by a), 


VaRa(L1 + L2) = ui + u2 + yo? + 2p0102 + 0367 (o) 


< uc u2 + yo? + 20102 + c29 (a) 


= pu + uad 4/ (01 + 03)?9- 1(o) = W + 219 (a) + pig + 239 l(o) 
= VaRq(L1) + VaRa(L2). 


Note. For a € (0,1/2), ®-!(a) < 0, so the inequality holds in the other direction and thus 
VaR, is superadditive. See Exercise 8.4 for a more general case. 
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VaR,(L;+L2), p=-1 
VaRa(Lı +L2), p = -0.5 
VaR,(L;+L,), p=0 
VaR,(L, + L2), p=0.5 
VaRa(Lı 4 L2), p=1 
-- VaR(Li) * VaR«(L2) 


1 
LIII 


VaRa 


Superadditivity : Subadditivity 
region D region 


Jointly normal (Li, L2) with N(0,1) margins and correlation p 


Figure S.2.3 a > VaRa(Lı + L2) for p € (—1, —0.5,0,0.5,1]. 


c) Figure $.2.3 shows the graphs. The figure indicates that the mapping p — VaRa(Lı + L2) 
is increasing in p for o € [1/2, 1) (and decreasing for o € (0,1/2); this can be proved 
analytically using the fact that VaRa(Lı + L3) = J/2(14- ®t (a). For p = 1, VaRa(Lı + 
L3) and VaR4(L) + VaRa(L2) coincide which is what was already shown analytically 
in Exercise 2.8. Furthermore, note that for p = —1, the distribution of Lı + Lə is the 
(degenerate) unit jump at 0 (L4 + Lə ~ N(0,0)) and thus VaRa(Lı + L2) is equal to 0 for 
all a € (0, 1). 


Ex. p. 17 Solution 2.23 (Shortfall-to-quantile ratio for exponential, normal and Pareto) 


a) It follows from Exercise 2.12 a) that VaR4(L) = —log(1— o)/A and ES,(L) = (1 — log(1 — 
a))/A = —(log(1 — o) — 1)/4. We thus obtain that 


. ESa(L) .. log(l—a)-1 . 1 
lim ————— —————-—] ]— ———__} = 1. 
ace VaRa(L) als log(1 — a) Jim ( log(1— 2) 
b) Using the fact that for a standard normal distribution ES,(L) = Ce) 
substitution z = $^ !(a) we get 


and making the 


ESQ(L) _ (x)/x 


la c9 
otl- VaRg(L) #220 1 — P(r) 


provided the latter limit exists. By l'Hópital's rule (case ‘0/0’) and the fact that o'(x) = 
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—ró(xr) we obtain that 


and thus that 


im ESa(L)_ —1 
o-i- VaRq(L) Da 


c) It follows from Exercise 2.12 d) that VaR4(L) = (1 — a)-!/? — 1 and ES,(L) = GVaRa(h) 41 | 
We thus obtain that 


ESa(L) _ pp O+1/VaRa(L) _ _ 8 


QUU vena et 0—1 ep 
The result shows that for light-tailed distributions (such as the exponential or normal) value-at- 
risk and expected shortfall are asymptotically equivalent, while for heavier-tailed distributions 
(such as the Pareto) the expected shortfall is asymptotically a multiple of value-at-risk; note 
that for the Pareto case, the difference between the two risk measures can be made arbitrarily 
large by letting 0 — 1—. A more general result can be found in MFE (2015, Section 5.2.3). 


Solution 2.24 (Matching VaR and ES) Ex. p. 17 


a) Let L ~ N(0,1) and note that, in distribution, L = u + cL. By translation invariance 
and positive-homogeneity, VaRo.99(L) = u + o VaRo.g9(L) and ES,(L) = u + e ES,(L), 
so ES,(L) = VaRo.99(L) if and only if ES,(L) = VaRo.g9(L). Solving $-1(0.99) = 
VaRo.99(L) = ESa (L) = $9 (9) with respect to a, one obtains approximately a = 0.9742. 


1—-o 
b) Solving 


t;1(0.99) = VaRo.99(L) = ES,(L) 


_ tz (o) (= eat) 


l-a v—1 


with respect to œ when v = 3.5, we obtain approximately o = 0.9683. 


Solution 2.25 (Matching VaR and ES for Pareto distributions) Ex. p. 17 
a) We have 
VaRg(L) =ESa(L) = (1- py? -1- 3 4 teorie 
0—1 isay fay 
= T sey (E= 
etum) tena) 


4< a=1- = -1/8) 
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Since a needs to be in (0,1), we require 
1-(1-8)1—1/0)* € (0,1) += 1—8 e€ (0, (1-1/6)°) 
= £€ (1= (1=1/6)’, 1). 


Note that (1 — 1/0)? € (0,1/e) and the term is strictly increasing in 0 > 1. Therefore, the 
larger the 0 > 1, the larger the range of 8 for which there exists an a € (0,1) such that 
ES4(L) = VaRg(L). 
Let 6 € (1 — (1 — 1/0)*, 1) and define 

o(0) 21— (1— 8)(1 — 1/0) *^ = 1— (1 — B)exp(—0log(1— 1/0), 61. 


Then, 


o'(@) -1—(1— B) — 1/6) (-1 leet e a C38) 
-1- (1 8 - 1/6)" (- log(1 — 1/6) - 1/(6 — 1 


) 
2144 E ESL 1)log(1— 1/0) +1). (81) 


Since 1 — 8 € (0, (1 — 1/0)?) and 0 > 1, the fraction in (S1) is greater than 0. Therefore, 
a'/(80) > 0 if (0 — 1)log(1— 1/0) +1 > 0 for all 0 > 1. Since 0 — 1 and log(1 — 1/0) are 
strictly increasing, but the latter is negative, we obtain that (0 — 1) log(1 — 1/0) is strictly 
decreasing in 0. We are thus done if we can show that lime. ,4,(0 — 1) log(1 — 1/0) > —1. 
This follows from l'Hópital's rule (case ‘—co/00’) via 


<< log(l-1/0) te 
D upper AS 


The smaller the parameter 0 > 1, the more heavy-tailed the Pareto distribution and thus 
the smaller the confidence level o needs to be in order for ES, to match VaRg; note that 
this is in line with the result obtained in Exercise 2.23 c). Thus a = a(@) needs to be a 
decreasing function for decreasing 0 > 1, or, equivalently and as we showed, an increasing 
function of 0 » 1. 


Advanced 


Solution 2.26 (Superadditivity of VaR for two iid exponential random variables) 


a) 


VaR, is superadditive if and only if MS (a) = VaRa(Li-- L2) > VaRa(L1)4- VaRa(L2) = 
F$ (a) + F$, (a) which is equivalent to 


a > Fo us (Fi; (a) + Fi; (9). 
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b) Since Ly, Lo ^ Exp(A), A > 0, the density f,,+41, of the distribution function Fy,+1, of 
1 2 


L4 + Lə is given by 
fuse) = | fafale- ydys f X exp(—dy — Az — 9) dy 
= a exp(—Az) dy = Mzexp(-Az), z>0, 
0 


where fz,, fr, denote the (equal) densities of L1, L5, respectively. Hence, integration by 
parts leads to 


Fr, px )- f fis) )dz2 X n zexp(—Az) dz 
ss = xf E ) 
=À ([ x P Aj] + ee exp(—Az) dz 


Sort $a exp Ax) sg (exp Ax) 1) 


= —Azexp(—-Ax)--1-exp(-Ax) = 1 — exp(-Ax)(l-c- Az), rz 20. 


Together with the fact that Fý (o) = Fj,(o) = —log(1 — o)/A, the condition in a) is 
equivalent to 
a>1- exp(-A(-2 log(1 — a))/A)(1 + \(—2 log(1 — a)/A)) 
=1=(1=@)7(1=2log(1 = a)). 
Therefore, VaR is superadditive if and only if (1 — a)(1 — 21og(1— a)) > 1 (independently 
of A). 


Consider the auxiliary function h(a) = (1 — o)(1 — 21og(1 — a)) — 1, which is positive 
if and only if VaR, is superadditive. Note that h(0) = 0 and, by l'Hópital's rule (case 


‘oo /00’), h(1) = —1. By standard calculus arguments it can be shown that h has a unique 


root ag € (0,1) and h(a) < 0 for a > ap. Therefore, VaR, is superadditive if and only 
if a € (0,09). Numerically determining the root ag of h in (0,1) leads to approximately 
ag = 0.7153. 


Solution 2.27 (Superadditivity of VaR for iid Bernoulli random variables) 
Since y» b Y; ~ B(d, p), VaR is superadditive if and only if 


d 
Fý (a) = VaRa (2%) > M VaRa(Y;) = d VaRa (Y1) = dF¥, (a), 


where Fn denotes the distribution function of B(n, p). This is equivalent to 


Fap(dFýp(a)) < o5 
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use the negation of MFE (2015, Proposition A.3 (iv)) to see this. Since 


—œ, a=J, 
FY, (a) = 4 0, a € (0,1 — p], 
1, QE (1 — p, 1], 
we have that 
0, a — 0, 
Fay(dFi,(0) = 4(1—p)5, o€(0,1— p], 
1, a € (1— p, 1] 


In the first case (a = 0), Fa, (dFi,(a)) < a is not possible. The same applies to the last 
case (a € (1 — p,1]), so VaR, has to be subadditive in these cases. In the second case 
(a € (0,1 — p]), Fa" (dFt,(o)) < a happens if and only if (1— p)? < a. We thus obtain that 


VaR, is superadditive if and only if (1 — p)? < a for 0 < o < 1 — p and hence the claim. 


Note. From a risk management perspective, one is interested in large a. If a > 1 — p, one is on 
the safe side as VaR is subadditive for all dimensions d. Otherwise, VaRq is superadditive if 
and only if a € ((1 — p)7,1 — p| which is especially large for large d. 

This exercise shows how MFE (2015, Example 2.25) can be extended to the case of more 
than two bonds, where the random variables Y; represent the default indicators of the bonds; 
see Hofert and McNeil (2014) for an extension to dependent random variables Y;. 


Solution 2.28 (Superadditivity of VaR for a heavy tailed distribution) 
We first compute the distribution function Fr,,r, of Li + Lo. Note that F has density 


f(x) = 1/(223?), x € [1,00). Since L4, Lo are independent, the density fr, r,(x) of Ly + La 
is given by 


z—1 
Sita) =| fier t) dt = = T E (x =4) “aad ier. TEL 3/2 dt. 


Completing the square and then substituting s = t — z/2, we obtain 


z/2—1 1 
frels) - , id py” B 1-2/2 (2-9? di 


Substituting s = 5 sint (t = arcsin(2s/x); ds = $ cost dt) leads to 


qp paresin(1—2/z) cos t 1 arcsin(1-2/z) 1 
fiy+to(2) = 5 2 gj e al . 2 
8 Jarcsin (2/z—1) (za — sin? t)) X^ Jarcsin(2/x—1) COS^Í 
1 arcsin(1—2/xz) 
B x2 [tan LR 
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Note that 
: : sin arcsin x x x 
an arcsin z = : = = . 
cos arcsin z y1 — sin? (arcsin 2) V1 — x2 
Hence, 
1 1— 2/z 2/z —1 2 1-2/z 
fra+L (£) = 2 z 2] 72 2 
a\/[—-(—2/e2  J1- Q/s- 1) a? J/1— (1 — 2/z) 
E 1— 2/z 1 1-2/y» — 1-2/2 
| z2J1—(1—4/zr-c-4/a?) a?Jl/m—l/a32 syr- 
1T—2 
= , & €[2,00). 
x2./x —1 | ) 


Check that fr,4r,(x) equals Fr. 47, (x) for Fr, 4 p,(z) = 1 — 2v2 — 1/z, x € [2,00). 
To determine VaRa(Lı + L2) = Ff pz, (&) we have to solve Fr, p,(z) = a with respect to z, 


which happens if and only if 1 — a = 2 — 1/x and thus if and only if /x — 1/xz = (1 —a)/2. 


Multiplying by x, taking squares and reordering the terms, we obtain 


This quadratic equation has the solutions 


(via =@2 — ey Ca, 
Tac Gage odes 


Note that the solution has to satisfy z15» > 2, which happens if and only if (1 — a)? < 
1+ 1- (1— a)?. Since yz > x for all x € (0, 1), 


1- /1-(1-o)? 1-(1-(1— 0)?) 
Co ^77 Gap 


which is not a valid solution. We thus obtain that 


—2 


di; 2 


Val Ur aiio tt sf - = ap ^n ES SAR VaR CES) 


for all a € (0,1). 


Solution 2.29 (Median shortfall) 

a) Since Fy, is continuous, P(L > Ff (a)) = P(L > F$ (o)) = P(FL(L) > a) = 1- P(Fr(L) € 
a) — 1 — o > 0 for all a € (0,1) and thus 

P(Ff (a) < L< zx) 


SE E > FE (a). 
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Since FT, is continuous we can simplify the last expression to 


I 

V 
E 
& 


FritsFe(a)(2) 
b) By a), we have that 


MS,.(L) = Fors FE (a) (1/2) = inf(x ER: Frits Fe (a) (2) > 1/2} 


=inf{x eR: a => 1/2} 


=inf{xeR: Fry(x )> (1+ a)/2}= VaR iża (L ). 


Hence, MS4(L) is just value-at-risk at a higher confidence level. 
c) Since Fr(x) = 1— (1 +x)’, x > 0, we obtain that VaRa(L) = Ff (o) = (1—a)-V? —1 
and so MS,(L) = (1 — (1 + a)/2)-V/? — 1 = 2° (1 — q)-1/? — 1. Furthermore, 


1 Loa 1/64131 
BS dye f a-du CEN E 1 


1—oJa l-a —1/0+1 ja 
H (1 — a)- ien TH oa — a)- 1/9 i 
~ (1-a) (1 -— 1/0) = 8-1 l 


By l'Hópital's rule (case *oo/oo"), 


0(1—o)-1/9 l-a 
FSO) MEE c as Se e ó 


Mx MS,(L) A im 21/9(1 = a)-1/0 —1 = um Ad = a)~V/o-1 ~ 21/8(6 —1) 


This implies that, asymptotically for large a and small 0 > 1 (heavy-tailed case), ESS(L) 
can lead to a (much) larger risk capital than MS,(L); by b), this should not come as a 
surprise. 


Ex. p. 19 Solution 2.30 (Shortfall-to-quantile ratio for t) 


Since L ~ t,(u,o?), L = (L—p)/o ~ t, and thus L = u--o L. Therefore, Fr(x£) = Fg((x—u) 
so that solving Fr(x) = a with respect to x leads to VaRa(L) = u-- e VaRa(L) = u+ otz! 
see also Exercise 2.21. 


/0), 
( 


a); 


By linearity of the integral, this implies that 


1 1 1 m ~ 
ES4(L) = T VaR, (L) du = + o— i VaR,(Í)du— u--oESQ(L). (81) 


l-a Jo 


For computing ES,(L), let fẹ (£) = c,(14- a?[v)-"3. denote the density of tp, where c, = 


TUUS Substituting z = £;!(u) (so that dz = 1/ f, (t; !(u)) du and thus f, (x) dz = du) 
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leads to 


n= f ends LL. iet s ios 


s= (u) l— & Jt! (o) 


= vey (1 + a2/v)- > dx 


- | "Fm 
taal 2. EH] 
= — Él c, (14 of) tin] 
= TO Cs tapa) 


l-a 
fu (ty (0))0 6 (0)?/v). 


(122v) 


oo 


t; (o) 


1 v 


l-—av-l1 


Hence, by (81), ESa(L) = u + 6315 74 fe (t7 (a) + t1(@)?/v). 


For the ratio, we thus obtain that 


p ets zh fut (0))0 + t (0)?/v) 
TA. —— wu 

oa—1- VaRa(L) | eo1i- nc ot, (o) 

nc esc rcr t x7/v) 

z—t; (o) 1—00 U+ ox 


= lim ( H 4 Eu 


v—1 1 
St, (x) (1-22 /v) 
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we obtain that 


ES.(L) V (fir U+?) w- fi (£)(1+z2/v)-1 
lim i = lim x 
a—1-— VaRa(L) v — ] 2-00 — fa, (x) 
OY m OUN) = fuz) t 2?/r) 
Vy— 1 z—00 — fi, (x)a? 
Vus Cx Hf (n)? — fi, (a) + 2?/r) 
= V — 1 z—00 — ft, (x) x? 
E E" rig? + (1+2?/v) _ NP a | » 
(-fy (2) V — 1 2e zi S. tty 

MEE 
Tea 


This implies that, asymptotically for large a € (0,1) and small v > 1 (heavy-tailed case), 
ES4(L) can lead to an arbitrarily larger risk capital measurement than VaRQ(L). 

Note. As a limit for v — oo, we obtain that, asymptotically for large a € (0, 1), it does not 
matter which of the two risk measures is used to measure risk capital in the case of a normal 
distribution. See also Exercise 2.23 b) where we already showed that. 
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Solution 3.1 (Stylized facts) 
According to MFE (2015, Section 3.1) univariate stylized facts are: 


(æ 


( 
( 
( 
( 
( 
( 


U1) Return series are not iid although they show little serial correlation; 


Series of squared (or absolute) returns show profound serial correlation; 
Conditional expected returns are close to zero; 
Volatility, that is the (here: conditional) standard deviation, appears to vary over time; 


Extreme returns appear in clusters; and 


U6) Return series are leptokurtic or heavy-tailed (power-like tails). 


Solution 3.2 (Negative log-returns and the correlogram) 


a) 


e» 


c) 


'The detailed description of the construction and interpretation of the correlogram can be 
found in MFE (2015, Section 4.1.3). The bars show estimates of the estimated correlations 
between observations that are separated by a lag of h days (displayed on the x axis). By 
convention a bar of size 1 is usually displayed at lag h = 0. The dashed horizontal lines 
on the right-hand side of Figure E.3.1 show confidence intervals for the autocorrelation 
estimates under the hypothesis that the data are from a purely random, iid process. 

The correlogram in Figure E.3.1 shows little evidence of serial correlation. 3 of 30 
estimated correlations protrude beyond the confidence interval in a non-systematic way, 
which does not give strong evidence against the iid hypothesis. 


The plot of the negative log-returns mainly supports (U3) and (U4) of Exercise 3.1, to some 
degree (U5), (U6) and the first part of (U1). The correlogram supports the second part of 
(U1) of Exercise 3.1. 


If the boundaries of the region of rejection of the test for independence are not exceeded 
by the sample autocorrelations p,(h), h > 1, this does not mean that the data is iid (type 
II error). One often finds significant exceecances (thus rejecting the hypothesis of having 
iid data) when checking the correlograms of the squared values; see Figure 8.3.1. This is 
the stylized fact (U2). A plot of the squared negative log-returns and their correlogram 
typically also show the stylized facts (U4) and (U5). For example the plot of the squared 
values shows that extreme values tend to be followed by extreme values (not necessarily of 
the same sign), so (U5). The correlogram typically also supports (U5), when indicating 
some form of linear dependence between the squared values. 
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Squared -log-returns of the S&P 500 index 
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Figure S.3.1 Negative log-returns of the S&P 500 index from 3 January 2011 to 31 December 


2015 (left) and the corresponding correlogram (right). 


Note. A basic formal test of serial independence for stationary data is the Ljung-Box test. 
For more advanced tests of stationarity and serial independence, see, for example, Hofert, 
Kojadinovic, et al. (2018, Sections 6.2.1—6.2.2). 


Solution 3.3 (Interpreting Q-Q plots) 


a) 


The Q-Q plot against the hypothesized distribution (here: a fitted normal distribution) 
shows an (inverted) S-shape: Small empirical quantiles are smaller than the corresponding 
quantiles of the hypothesized distribution (see the x-axis values below —0.1) and large 
empirical quantiles are larger than the corresponding hypothesized quantiles (see the x-axis 
values above 0.1). In other words, such S-shaped Q-Q plots indicate that the sample exhibits 
heavier tails than the hypothesized distribution. Note also that the empirical quantiles are 
slightly larger than the hypothesized ones around —0.04 and slightly smaller around 0.04. 
'This means that the normal distribution puts more mass in these regions than the empirical 
distribution. 


The analysis of the Q-Q plot could be supplemented with a Jarque-Bera numerical test of 


normality. 


Any fitted normal distribution F with mean p and variance c? is a location-scale transform 


of a standard normal distribution ®. Since F^ (u) = u + o^ !(u), the quantiles are 
location-scaled, too. As such, only the intercept and slope of the line to compare against 
needs to be adjusted for a comparison of the sample and hypothesized quantiles. In standard 
software, this intercept and slope is estimated anyways; see the short source code of the 
function qqline() in the R package stats, for example. 
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Note. One should be aware of what procedure is used when creating a Q-Q plot with an 
existing piece of software. If the intercept and slope are estimated, then one actually only 
compares whether the sample comes from a type (see MFE (2015, Section A.1.1)) of the 
hypothesized distribution, so a location-scale transform of the hypothesized distribution. As 
the conclusion of a hypothesized distribution to be adequate — although only a location-scale 
transform is — can be dangerous, the function qq_plot() in the R package qrmtools allows 
for both versions (check against a location-scale transform of the hypothesized distribution 
by estimating intercept and slope of the line shown or exact check against the hypothesized 
distribution). 


Solution 3.4 (Multivariate stylized facts) 
According to MFE (2015, Section 3.2) multivariate stylized facts are: 


(M1) Multivariate return series show little evidence of cross-correlation, except for contempo- 


raneous returns (that is at the same time point); 


(M2) Multivariate series of absolute returns show profound cross-correlation; 


(M3) Correlations between contemporaneous returns vary over time (difficult to infer from 


empirical correlations due to estimation error in small samples); and 


(M4) Extreme returns in one series often coincide with extreme returns in several other series 


(for example tail dependence). 


Solution 3.5 (Joint negative log-returns and the cross-correlogram) 


a) 


A cross-correlogram displays estimates of the cross-correlation function (CCF) (along the 
y-axis) at multiple lags h (along the x-axis), that is, it displays estimates pxy,n(h) of 
pxy(h) = corr(X,,5, Yt) for each lag h in a finite subset of N; the estimates pxy.n(h) 
(also known as sample autocorrelations) are obtained from all available data pairs which 
are h units apart, so (X145, Y1), (Xo44, Yo),..., (Xt, Yin). Note that in contrast to the 
correlogram of a univariate time series, the cross-correlogram does not necessarily show 
1 at lag h = 0 since pxy,4(0) (similar to corr( X+, Y;)) is not necessarily equal to 1. The 
cross-correlogram in Figure E.3.3 overall shows little serial cross-correlation except for lag 
h = 0 (correlation between contemporaneous returns). 


Note. The plotted dashed horizontal lines indicate the boundaries of the region of rejection 
for a test whether the two time series are independent. It is important to note that these 
bounds are only valid if the two time series are white noise, so each being uncorrelated; see 
Brockwell and Davis (2002, Theorem 11.2.2) and thereafter. 

The plot of the joint negative log-returns supports to some degree (M4) of Exercise 3.4. 
The cross-correlogram supports (M1). 

Note. For the trained eye, a better plot to support (M4) is to consider the pseudo-observations 
of the negative log-returns instead, according to which each component sample is replaced by 
its ranks, divided by the sample size (typically plus 1); see MFE (2015, Section 7.5.1). The 
left-hand side of Figure 8.3.2 shows a plot of these pseudo-observations where a clustering 
of points in the bottom-left and top-right corners visually indicates (M4). 
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c) According to (M2) of Exercise 3.4, we expect the cross-correlogram of the squared negative 
log-returns to show profound cross-correlations. The right-hand side of Figure $.3.2 confirms 
this behavior and stylized fact (M2). 
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Figure S.3.2 Pseudo-observations of the negative log-returns (left) and cross-correlogram of 
the squared negative log-returns (right) of the S&P 500 and DAX index from 3 
January 2011 to 31 December 2015. 


Solution 3.6 (Longer-interval log-returns) 


a) The h-period log-return X = log(S,, 5/54) satisfies 


h h 
(h) _ (>+) u ( Suh Sun : u ( Su ) = 
X =lo =lo re = lo = Xt. 
ne 5 St i: Stth—1 St+h—2 St 2. $ Stu 2. is 


b) For u = E(X441) and o? = var(X;,1), the Central Limit Theorem implies that 


(h) 

x in — 

VR hE 4 N(0,1), 
o h—oo 


so that, for h sufficiently large, X is approximately N(hp, ho?) distributed. 
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Solution 3.7 (Manipulating logarithmic and relative returns) 


a) 


Solving Y, Y = = (Sip — St) /S_ with respect to Styn, we obtain Styn = (1 + YS; so 


St+h/ St = 1+ x and thus log(S;4,/S;) = log(1 + yey, The log-return x can thus 


be obtained from the simple return yí h) via A = log(1 + YO»), 


(h) 
t+h 


eo via X = — log(1 — YO», 
Solving x) = = log(St+h/St) with respect to Sth, we obtain Sti; = exp(X/7)) Si, so 
Sin — St = (exp X (9 )- 1)S; and thus (Styn — S4) /S; = exp(X()) — ]. The simple return 


t+h 
Y can thus be obtained from the log-return x via Y = exp(X(),) —1 


t 
( 


Note. The negative log-return X;, can thus be obtained from the negative simple return 


Y, 


Note. The negative simple return Ý, 
x0 (h) _ xo 


h) can thus be obtained from the negative log-return 


Xiph via i= (exp( Xith) ly 
Given log-returns bo MN ... and S;, we can construct S,,4, Stropn,... via Spin = 


) 


$, exp( X. x), Siok = Duk exp(X(!),) = S, exp( X ®, + X) etc. and thus Sikh = 


Stexp( E X kc 


Note. For negative log-returns as a s ..., the formula is $,, 4 = S; exp(— 2 1 a 
k 2 1. 
Given simple returns YD YT. ... and S, we can construct S55, $4425,... via Sepp, = 


SL HYR), Suas = Sua YA) = SQ + YAT) + Y.) ete. and thus Sia = 
STIEG + Y, AUNT k 21 


Note. For negative simple returns Y. Y ..., the formula is St, i, = St ma- F), 
k>1. 


Solution 3.8 (Experimenting with Q-Q plots) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 3.9 (Ljung-Box test) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 3.10 (Jarque-Bera test) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 3.11 (Skewness, kurtosis and higher moments of standard normal) 


a) 


Since X is a random variable with a density symmetric about 0 for which the kth moment 


133 


Ex. p. 22 


Ex. 


Ex. 


Ex. 


p. 23 
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exists, we have 


za) = f^ ak f(e)ar=(-0 f. Coro [^2 


EE miu SS eb | f2so°a* f(x) dz, kis even, 
=(-1 [v ronds f “sede D eats 


b) For even k, substituting y = x? (so x = gy and dx = 1/(2,/y) dy) leads to E(X*) = 
do y 70/26 y) dy, where ó(z) = exp(—2?/2)/4/2n, x € R, is the density of X ~ N(0, 1). 
Substituting z = y/2 we obtain for even k that 


h Qit(k-1)/2 poo ,. 
E(X^) === fv" /2 exp(—y/2) dy = "ERU z (FU? exp( —z) dz 
2(k+1)/2 /k+1 2/2 ede 
E 
Repeatedly applying the functional equation T(z + 1) = zT (z), we obtain that 
k/2 

kiir k-1,f/k-1Y o k-1k-83 1 r(1/2) 
'( 2 )- 2 r( 2 Jens z a eg ae Le (2j = 19. 
Furthermore, using the fact that [(1/2) = yr we obtain I'((k + 1)/2) = x5 IDA - 


(27 — 1)). Hence, for all k € N, 


E(X^) _ Ak — (2j — 1)), kis even, 
0, k is odd. 


c) We immediately obtain from b) that 8 = 0, x = 3 and E((X — p)®)/o® = 15. 


Solution 3.12 (Changing correlation over time) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial. org. 


Advanced 


Solution 3.13 (A simple version of the runs test) 


a) Every sequence of signs starts with a run. For t € {2,...,n}, a new run starts if Y; Z Y; 4. 
Since in this case (Y; — Nr) — 4, we have to divide by 4 to increment the count correctly. 
b) Under the assumption that P(Y, = —1) = P(Y, = 1) = 1/2, we have that Z = (Yi — 
Yi.i1)?/4 ~ B(1,1/2). Clearly, Z; and Zi- are independent for h > 1 under the null 
hypothesis. We can also check that P(Z, 1 = 2-1, Zt = z) = P(4-1 = %-1)P(Z = zi) 
for zi, zt-1 € {0,1}; for example P(Z;..4 = 0,2; = 0) = P(Yi-2 = Ya = Yj)) = PY = 
Yi- = Y = 0) + P(Yi-2 = Yı = Yi = 1) = 1/2 + 1/2? = 1/4 = (1/2)(1/2) = P(A = 
0)P(Z; = 0). Hence the Z;'s are independent and thus Rn — 1 = 357.9 Zi ~ B(n — 1, 1/2). 


— 
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Advanced 


If n is large, then, under Ho, Rn can be approximated by a normal random variable with 
mean E(R,) = (n + 1)/2 and var(R,) = (n — 1)/4. If a € (0,1) (for example a = 0.05) 
and the realized test statistic Rn falls below the (a/2)%-quantile or above the (1 — a/2)%- 
quantile of this normal distribution then we can reject the null hypothesis at the (1 — a)% 
confidence level in a two-sided (asymptotic) test. 


One could consider centered variables X; = X? — mo or X; = X2 — po, t € {1,... n}, 
where mo = med((X2),) and u2 = 1 7, X? denote the sample median and sample mean 
of X2,..., X2, respectively. If we anticipate clusters of small and clusters of large values of 
X1,..., X2 (so volatility clustering), then this would result in clusters of small and clusters 
of large values of X4,..., Xn and thus clusters of Y;,..., Y, of equal sign (when computed 


from Xi,... , Xn), so a large R, and therefore a higher probability of rejecting the null 
hypothesis of X2,..., X2 being iid. 


Solution 3.14 (Sensitivity of correlation estimates to volatility) 


R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 3.15 (Geometric spacings between largest values in iid samples) 


a) 


c) 


Using the iid property of (X;);en, we have that for any j € N, 


P(S; = k) = P(T} — Tj-1 = k) = P(T; = Tj-1- k) 
= PCXT, i41 < Uy... , XT; i+k-1 < u, Xr, itk > u) 
k—1 
= P(X; < u,..., Xp-1 < u, Xp > u) = P(X; > u) p x < u) 
i=1 


p = p), k € N, 


so Sj ~ Geo(p) on N for all j € N. We have left to check the independence property. To this 
end consider any n € N, distinct indices j1,...,j4 and natural numbers &k4,..., kn. Then 
the events {S;, = ki}, à € (1,..., n), are independent since (5;, = ki} only depends on X; 
for | € (T5, 1 + 1,..., Tj; 1 + ki = Tj,}, which are blocks of independent random variables 
(each block being determined by an i € (1,...,n]). 


We can construct a Q-Q plot. Let £1,...,£n denote realizations of X1,..., X, and let u 
be the empirical (1 — o)-quantile of z,...,c; for some a € (0,1). Let zi,..., riy, with 
{ti,...,tw,} € {1,...,n} and tı < +- < ty, denote the subsample of those xps which 
exceed the threshold u. Furthermore, let tj = 0 and s; = t; — tj;-1, j € {1,..., Nu}. The 
sample s1,...,5y,, denotes the spacings of the largest a% of z1,..., £n and the Q-Q plot 


can thus be obtained by plotting TU c0(0.05) (pi) sj): j= 1,..., Nu} for pj = j/(Nu + 1). 


If the iid hypothesis holds (so no volatility clustering or clustering of extremes is present in 
the data), the Q-Q plot should (roughly) lie on the line y = z. 


Let Fgeo(p) and Ffxp(à) denote the distribution functions of the Geo(p) distribution on N 
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Ex. p. 25 


Ex. p. 26 


3 Solutions to Empirical Properties of Financial Data 


and the Exp(A) distribution, respectively. If p € (0,1) is small, log(1 — p) ~ —p and thus 


Faeo(g(k) = Spa -= p)! -»Y E =p Den ed (1— p)* 
«e = 2 I-(-») 


i0 
—1-exp(klog(1— p)) & 1 — exp(—pk) = Ffxp(p) (k), FEN, 


so the distribution function of the Geo(p) distribution on N can be approximated by the 
distribution function of the Exp(p) distribution if p € (0, 1) is small. 

d) Figure S.3.3 shows a plot of F@eo(p)(k) together with FRypp)(k), k € (1,..., 200), (left) and 
a plot of the relative error |(Faeo(p) (k) — FExp(p)(k))/ Faeo(p)(K)]; k € (1,..., 200}, (right) 
for p € {0.01, 0.02, 0.04}. 


i: 


Faeo(p)(K) (solid) and Fexp(p)(k) (dashed) 
Relative error at k 


0 50 100 150 200 0 50 100 150 200 


Figure S.3.3 Foeo(p)(k) together with Fexp(p) (k), k € (1,...,200], (left) and a plot of the 
relative error |(FGeo(p)(k) — FExp(py(k))/ Faeo(p)(K)]; k € {1,---, 200}, (right) for 
p € {0.01, 0.02, 0.04}. 


Solution 3.16 (Log-returns implied by geometric Brownian motion) 


a) The h-period negative log-return x = — log(St+n/ S+) satisfies 


c NS Sua _ So exp((u — o?/2)(t + h) + oWisn) 
Xin = log( S, ) 7 les( So exp((u — o?/2)t + cWA) ) 
= -(((u — e? /2)(t-+ h) + oWegn) — (Cu — 62/2) + oW) 
= - ((u.— 0?/2)h + (Was — Wi) NC-(u — 0?/2)h, oh), 


where we used that Wii, — W; ~ N(0,t + h — t) = N(0, h). Therefore, h-period negative 
log-returns of stock prices modeled by a geometric Brownian motion are normally distributed. 
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Empirical properties of financial time series imply that negative log-returns of stock prices 
are typically leptokurtic, so have higher kurtosis than the normal distribution and thus 
more mass about the mean and in both tails. As such, geometric Brownian motion is not 
supported as stock price model. 

For h,k’ > 0, x = — log(St+hn/ S+) depends on the increment W;+n — W; and ad = 
—log(S¢+n)+n'/St+n) depends on the increment Wa+n)+a' — Wr. By the independence 
of non-overlapping increments (property 2)), the two risk-factor changes are independent. 
This is not supported by the stylized facts that volatility appears to vary over time and 


that extreme returns appear in clusters. 


137 


4 Solutions to Financial Time Series 


Review 


Solution 4.1 (Stationarity) 


a) 


(Xi)iez is covariance (or weakly) stationary if E(X?) < oo, t € Z, E(X;) does not depend on 
t and cov(X;, Xs) only depends on the lag h = |t — s|. (Xz)tez is strictly stationary if finite- 
dimensional distributions of the form (X;,,..., X;,) are invariant under time shifts, that is 
the distributions of (X;,,..., X1,) and (Xi, ..., Xi, 44) coincide for all t1,...,tn,k € Z 
and n € N. 


Weak stationarity only makes assumptions on the finite-dimensional marginal distributions 
of (X;)tez through the covariance (thus through a second-moment-type of condition), which 
is weaker than what strict stationarity assumes. For a concrete example, consider a series 
of independent random variables with mean zero and variance one such that odd indices 
correspond to variables from a different distribution than those for even indices. 

Strict stationarity on the other hand does not make the assumption of existing first two 
moments of each X;, which is weaker than what covariance stationarity assumes. For a 
concrete example, consider a series of iid random variables with infinite variance. 


Note. See also Exericse 4.4 b). 


The difference between (univariate) classical statistics (where we assume to observe real- 
izations of iid random variables (X;);ez) and (univariate) time series modelling (where 
we assume to observe realizations of a discrete-time stochastic process (X;)+ez) is that in 
the former setup we assume to have observed data from independent replications of the 
same experiment whereas in the latter setup we have only observed the outcome of a single 
experiment over time (observations at different time points are typically not independent of 
each other). Unless we have some form of stationarity, it is not possible to analyze time series 
in a meaningful way, think of observing realizations of X1,..., Xn, each random variable 
from a potentially different distribution (or even just different means, say); it is impossible 
to determine a distribution (or mean) based on a single observation. A covariance function 
at lag 1, for example, can only be estimated in a meaningful way based on observations 
from X4,..., Xn if (X1, X2), (Xo, X3)... (X41, Xn) have the same covariance. 


A linear trend can often be removed by differencing. For example if E(X;) = ct for some 
c € R, then the differenced time series V X, = X,— X, has mean E(V.X;) = ct—c(t—1) = c 
for all t, and thus a constant mean. Turning non-stationary data into stationary one can 
often be done with transformations of the data (which then need to be undone after modeling 
of the transformed data took place). 
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Ex. p. 27 


Ex. p. 29 


Ex. p. 29 


4 Solutions to Financial Time Series 


Note. An interesting result concerning covariance-stationary processes is Wold’s decomposi- 
tion which says that any such process allows for a decomposition into a sum of two time 
series, one deterministic and one stochastic. 


Solution 4.2 (Identification of ARMA processes) 


= Row 1: The ACF cuts off after lag 1, the PACF decays exponentially. This hints at an 
MA(1) process, so Model d). 


= Row 2: As this path shows an increasing trend (and also seasonality), this time series is not 
stationary. It thus corresponds to Model b). 


= Row 3: The ACF decays exponentially, the PACF cuts off after the second lag. This hints 
at an AR(2) process, so Model c). 


" Rows 4 and 5: Although based on simulated data, the paths in the fourth and fifth row are 
non-trivial to distinguish. For example both ACFs seem to cut off after the second lag. The 
PACF of the time series in the fourth row seems to decay exponentially, which suggests the 
MA(2) model, so Model a) and thus the model in the fifth row is Model e). 


Note. 'The process in the fourth row was indeed simulated from Model a) and the one in the 
fifth row from Model e). We see that even for simulated data, there might not be a clear-cut 
boundary between different models. 


Solution 4.3 (GARCH models and stylized facts) 


a) The stylized fact of volatility clustering can be captured by GARCH processes. In GARCH 
processes, the conditional variance depends on past values of the squared process and past 
values of the conditional variance itself. If at least on of the two is large, the conditional 
variance for the next unit step in time is also large. 


b) The variance is 2/(1 — 0.1 — 0.85) — 40. 


c) The ACF of the squared process would follow a very similar form to that of an ARMA(1, 1) 
process with AR parameter $4 = a; + 61 = 0.95 and MA parameter 04 = —6ı = —0.85. 


Solution 4.4 (White noise) 


a) A strict white noise is an iid process whereas a white noise is an uncorrelated covariance- 
stationary process. 


b) Independent and identically distributed Cauchy variables provide an example of a SWN that 
is not WN. The process described in the solution of Exercise 4.1 b) provides an example of 
a WN that is not SWN. 


Solution 4.5 (AR models with ARCH errors) 
a) The equations are 
Xt = tt ti, 


ht = p Oi(Xt-1 — p) 
of = ao + a1 (Xa — pe-1)”. 
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b) The conditional 6-quantile of X,,4 (value-at-risk) is given by 
VaRg = pta + o+4190(Z), 


where Z is a generic variable with the innovation distribution Fz. We obtain the formula 


VaRg = u + éY(X« — u) + V ao + a (Xt — p — $1(Xt-1 — u))?ae( Z). 


Solution 4.6 (QML estimation of GARCH models) 

In QML, the likelihood function for the case of Gaussian innovations is maximized (regardless 
of the particular innovation distribution). Standard errors for parameter estimates have to be 
calculated using an asymptotic covariance matrix estimator of sandwich form. QML tends to be 
used when a modeller is unsure about the correct innovation distribution and more concerned 
with getting plausible estimates of the dynamics of the model. 

The same parameter estimates are obtained as for full maximum likelihood with an assumption 
of Gaussian innovations, since the same likelihood is mazimized. However, different standard 
errors are typically obtained for the ML and QML methods due to the use of the sandwich 
estimator in the QML case. 


Basic 


Solution 4.7 (Checking stationarity) 


a) (Xi)tez satisfies E(X?) = o2 < oo and has a constant mean function equal to 0, but 
since cov(Xi41, Xt) = E(XiiX;) = Eleeyiypiee) = cov(Et+2,€t) = 0 if t is odd and 
cov(X¢41, Xt) = E(Xi,1Xi) = Eleesietsi) = var(er41) = o2 Æ 0 if t is even, (Xi)iez is not 
(covariance) stationary. 


b) (Xz)teno satisfies E(X?) = hp E(c2) = to? < oo and has a constant mean function equal 
to 0, but since var( X+) = E(X?) = to? is not constant over time, (X;)ten, is not (covariance) 
stationary. 


Note. If (€t)+ez is a strict white noise (iid), then (X;);ew, is a random walk. 


c) First, the second moment of (X;);ez is finite since E(Z7) = 0% < oo, j € {1,2}, implies 
that E(.X2) < oo, t € Z; multiply out to see this. Second, the mean function u of (X;);ez is 
p(t) = E(X) = sin(4t) E(Z2) + cos( 5t) E(Z1) = 0, t € Z, which is constant in t. Third, if 
a = (1/6)t and b = (x/6)(t +h), then the covariance function y of (Xt)rez is 

*(h) = cov(Xi+r, Xe) = E((Xirn — EXt¢n)(Xt — EX)) = EXC X) 
= E((sin(b)Z2 + cos(b) Zi) (sin(a) Za + cos(a) Zi)) 
= E(sin(a) sin(b) Z2 + sin(a) cos(b) Z1 Z2 + cos(a) sin(b) Zi Z2 + cos(a) cos(b) Z2) 


sin(a) sin(b)o? + cos(a) cos(b)a% = o2, cos(a — b) = o2 cos( Eh), tEZ, heZ, 


where we used an angle addition formula. We see that y only depends on the lag h, not 
on t. Hence (.Xi)iez is (covariance) stationary with mean function u(t) = 0, t € Z, and 
autocorrelation function p(h) = 7(h)/y(0) = cos(§h), h € Z. 
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Ex. p. 29 


4 Solutions to Financial Time Series 


Ex. p. 29 Solution 4.8 (Symmetry of ACF) 
Since, under covariance stationarity, y(—h,0) = y(—h + h, 0 + h) = 7(0,h), h € Z, we obtain 
E —h,0 0,h h,0 h 
ai ipd ) xCh0) _ 10A) «040) _ rh) 


WO ^ x90 - 39 - 39 (0) ^ ^en 


Ex.p.29 Solution 4.9 (MA(2) process) 
a) X, = Po Vk£t-k for Vo = 1, V1 = 01, Y2 = 0» and vy = 0 for all k > 3. Clearly, 
Ro [ee] = 1+ 01 +02 < co, so (Xi)iez is causal. For stationarity, we check the following: 
= The second moment of (X;)iez is finite since E(X?) = E((e¢ + 0161.1 + 0561 3)?) = 
c2 + 6202 + 0202 = (1+ 62 + 62)o2 < oo, t € Z. 
= The mean function u of (X;)iez is u(t) = E( Xt) = E(&t + 01611 + 0261-2) = E(ex) + 
O,E(e¢-1) + 09E(&4 2) = 0, t € Z, which is constant in t. 


= The covariance function y of (.X;)iez is 


y(h) = cov(Xt4n, Xt) = E(Xcon Xt) 
= E(ertn Xt) + O1E(er4n_1Xt) + 02E(8145 2 X1) 


B(e?) + 81E(e1 1) + 63E(e2.3) = (1 + 01 + 63)o2, h=0, 
J 0+ &4E(e2) + 05 - O,E(e?_1) = 01(1 + 02) 02, h=1, 
B 0 4- 0 + 03IE (£2) = 6202, h=2, 
0, h>3 


Since, by definition, y(—h) = y(h), h € Z, the autocovariance function 7(h) indeed only 
depends on h € Z and thus (X;);ez is stationary. The corresponding autocorrelation 
function is given by 


1, h=0, 
01(14-05) I 

h E 1:92:01: [h| = 1, 

p(h) = 05 Ih| — 2 

crepe h-2 

0, |h| > 3. 


p thus correctly cuts off after the second lag. 
b) Since P, X,,4 = E(Xtin | Fe), h 2 1, where F; = o({X; : s € t}), we obtain that 


P,Xq1441 = E(XQ4 | Fe) = E(&4- 018 + 09613 | F1) = 018: + 02811, 
PLXq45 = E(&42 + 01641 + 0261 | Ft) = Ozet. 


Since X,,; only depends on innovations e; with index k > t+ h — 2 which is greater than t 
for all h > 3, independence of such innovations from F; leads to P, X4, = E( Xia | Ft) = 
E( X44) = 0 for all h > 3. 
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Solution 4.10 (Absolute summability condition) 
By the Monotone Convergence Theorem (MON), the Cauchy-Schwarz inequality (CSI) and 
stationarity, we have 


n 


( y» il) = z( dim, 28772 )N- Mon m. ( » [e 


k-—-—oo k-——oo k-——oo 


= lim SO [del (ere) ES Jim D |o | Eleg) 
k=—0o 
= y E(e i) lim. » |Vx| = VE 2 y |óx| =o: M. [Yr] < oo, 
k-——oo ) 2o. k-——oo 


which implies that X>% ,, YkEt—-k converges absolute with probability one. 


Note. See Brockwell and Davis (2002, Proposition 3.1.1) for a more general result. 


Solution 4.11 (AR(1) process) 


a) 


x (Yi)tez be idu 2 solution of (E1) and consider Z; = X, — Y;, t € Z. Then 

— h1 Xi-1 = & = E t € Z, implies that Z: — $1Z,-1 = 0, t € Z, and so, for all 
n e N, Zi = 014-1 = = $? Zi s, t € Z. Therefore, for all n € N, the Cauchy-Schwarz 
inequality and PE E of (Zt)iez implies that 


0 < E(|Zil) = li E(Zea < le VEL = VEGDIAS tez 


By stationarity of (Z;)iez, E(Z2) < oo and thus the right-hand side converges to 0 for 
n — oo for every t € Z. As the left-hand side does not depend on n, we obtain that 
E(| Z|) = 0, t € Z, and thus that, almost surely, Z; = 0, t € Z. 

Since (Y;);ez is of the form as in Exercise 4.10 for Y = 0 for k < 0 and vy = ¢ for 
k > 0, the claim follows from Exercise 4.10 by noting that 359. 4, |vx| = Yo [ei^ = 
1/(1 — |i) < oo 

Since Y; = YX o olei 4, t € Z, 


— Yı = X (ferr) - Q1 » $ieici-k = Y (per) p? $1 et (ua) 
k=0 = k=0 


oo oo 
k 
= X (ofeg) ~ piei) = pleo =E t€Z. 
k=0 k=1 


By a), we thus know that the unique stationary solution (X;)iez of (E1) is X, = Y; = 
Y 25.0 Ferk, t € Z, almost surely. 


Note. This series representation of (X;),ez justifies the use of the backshift operator B 
defined by BX; = X1: Xi = $1D Xi + Et, SO (1 = $1B)Xi = Ef thus Xi = e (1 = $B) = 
Drool B)"e = reo Of et—k- 
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4 Solutions to Financial Time Series 


d) As (Xi)iez is stationary, we have 0 = E(e;) = E(X;)—@1 E(X¢_1) = (1—93)E(X1) = (1-¢1) 


for u = E( X+), and thus u(t) = u = 0 for all t € Z. 
The autocovariance function y must satisfy 


y(h) = cov(Xt4n, Xt) = E(Xt4n, Xt) = E( (91 Xt+h-1 + Et+n) Xt) 
= bE(Xt4n—1Xt) + E(erpn Xt) = )15(h 1) +0 =- = 4f7(0), REN. 


Dividing by (0) and using symmetry, we obtain that the autocorrelation function p of 
(Xi)tez is p(h) = ol", h € Z. So the autocorrelation function of an AR(1) process decays 
exponentially as a function of the lag h. 

Note. 


1) Based on X, = EZ o Vei for Yk = Qf, t € Z, we can also obtain the autocorrelation 
function from MFE (2015, Proposition 4.9) via 


— MER eu Eoo am h 
pth) = oo coy gm c5 he 
M-o VE ko 01 


2) The exact form of the autocovariance function is obtained by computing y(0) = E(X2) = 

E(($1Xi-1e))?) = (1TE(QX?..,) -201E (X16) -E(e?) = é1E(QXZ) 07-02 = gf) 02, 
so that (0) = 02/(1 — $2) and thus 7(h) = olo? /(1 — $2), h € Z. 

Let F; = o({X,:5 € t]). Using 


Xi = Xii + et = O1(G1 Xe_o t+ €i) + Et = P Xio + hei te 
n—1 
= óiXi 3 + diea + beri + Et = = OP Xin + 5 Rein, n cN, 
k=0 


and thus that X,,5 = OLX (tt h)—h + ee OLE (t+h)—ko h € N, we obtain that 


A) 
h—1 


= 9E(X |.) + M  efE(ec | Fr) = of Xt, LEN, 
k=0 


h—1 
P; Xin = E(Xt4n | Fe) = E (ox: T 5 PFE (t4h)—k 
k=0 


where we used that £, is independent of F, for all s > t. 


Ex. p. 30 Solution 4.12 (ARMA(1,1) process) 
a) (Xz)tez is an ARMA(1,1) process with corresponding polynomials olz) = 1 — ġız (for 


the AR part) and 6(z) = 1 + 62 (for the MA part) which have no common roots unless 
Q1 4- 01 = 0. Therefore, ( X;)iez is stationary and causal if and only if ¢; Z —@ and ¢ has 
no roots in the unit disk D = {z € C : |z| € 1}, so |d1| < 1. 
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b) As (Xi)iez is stationary, we have 0 = E(e;) + 61 = 1) = E(Xi) — Q E(Xi-1) = (1 - 
$1)E(X;) = (1 — $1)u for u = IE(X;), and thus u(t) = u = 0 for all t € Z since |à1| < 1. 
The coefficients (vj)iew, of the representation X, = ue o YkEt—-k Of the stationary and 
causal (.X1):ez as a linear process can be found by considering 


2 v2 à(z)' E > 4, 


see MFE (2015, p. 103). Since 


O(z) 1lctÓz c k k kl 
Uc dew = (1 + 042) 2, (52) -M $iz)" PX 


=14 Dah Y asit ore Yen ore Ya (ó1 + 61)z5, 
k=1 


we obtain that yo = 1 and vy = $t 1(¢1 + 41), k € N. Since 229 |V| < oo, MFE (2015, 
Proposition 4.9) implies that the autocorrelation function p(h) of (X4),ez for h Æ 0 is 
ko VED ELA _ Ynt ($i FA? Oe. er n NS 

Deo Ve 1+ (d1 + A1)2 DZ, Ge 

(br + 1) + Ol (9) + 2D OO? 

1+ (61 1)?/(1 — e) 

“(or + 6) = 9D ot (és +01)? 

1— {+ (¢1 + 061? 

of hi + 1) = bó +1) _ o (6 6)0 + gib), 


mE =] ’ 


1+ 2616; + 6? 1+ 26461 + 6? 
for h = 0, clearly p(h) = 1. 
c) In this case X; = € almost surely for all t. 
Solution 4.13 (Kurtosis of GARCH(1,1) process) 
a) Let Y; = Ii- (a122; + £1), i € N. It follows from MFE (2015, equation (4.27)) that 


xt =2}(a0(1+ 3° [lat +0))) =h) 


i=1 j=l i=l 


2103 25392 5 


41—14152—1 


p(h) — 


Note that E(Z#) = Kz < oo, so E( X7) < oo if and only if the expectation of the last factor 
in the displayed equation above exists. First, 


(rx) = Y` IDE? 8) = Y [Té £0 = Y (o + Bt. 
i=1 i=1 j=1 i=1 j=1 i=1 
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4 Solutions to Financial Time Series 


This sum is finite as long as o + £1 < 1 (recall that o4 > 0, £1 > 0) which holds for any 
covariance-stationary GARCH(1, 1); see MFE (2015, Proposition 4.21). Second, 


(( s x: YY) zs (( il il (aZe 5, + 1) (e Zz j, *&)) 


i=l i2=1 ij=lig=l `ji=l jo=l 

oo oo i1^d2 i1Vi2 

= >) 2( [[ (uZ7_;+ 4)? I] (-1Z2 4 i) 
ti- issi j=l k=i1Ato4+1 
oo oo à1^i2 i1 Viz 

=X )_ [|[ E% +6) [[ Elaz? + 41) 
i=l i2=1 j=1 k=11 Aig4+1 
oo oo 

= 5 X E((a1 Z2 + 81)? ^? (ay + &i Mia iia 
tie dls 


where x ^ y = min{z, y} and x V y = max{x, y}. We thus see that E(.X7) < oo if and only 
E((a1Z? + 61)?) < 1. Since 


1 


rH 


E((a1Z? + 81)?) = afE(Z/) + 2018, E(Z?) + 82 = od&z + 20181 + 6? 
= OlKZ - o? +(a,+ A? = (Kz — 1)o2 + (a1 + AY, 


the claim readily follows. 
b) By MFE (2015, Section 4.2.2), 


= &z(1— (a1 + 61)*) 
1— (o1 + 61)? — (kz — 1)ot 


KX 


If kx = Kz, then both are equal to 1. This is a very light-tailed process and we would be 
unlikely to encounter it in practice. Generally we work with models where the GARCH 
mechanism inflates the kurtosis of the innovations. 


Ex. p. 31 Solution 4.14 (GARCH(2,1) and GARCH(1,2) processes) 
a) Recall that a covariance-stationary GARCH(2,1) process has the form X; = o1Z4 for 


of = ao +a X71 +a2X7o+fioz,, t€Z, (S1) 


where (Zi)tez ~ SWN(0, 1) and ao > 0, o4 2 0, a9 2 0, 81 > 0 with o4 + o» + f1 < 1; see 
MFE (2015, Section 4.2.2). Since o44; and Zt+n are independent and Z,,; is independent 
of F, for all h > 1, we obtain that 


PX = E(X en |F) = R(Of sn | Ft) EZ n |F) = E (07.1 | Ft) E(Z?,4) = E (07,1 | Fz). 


In particular, 


E(X in| Fe) = Eloi Fe) hz 1. (S2) 
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We obtain from ($4) and (S2) that 


PX? n = E (07, 1, | Ft) 


ao + 01X2 + a3 X2 + £192, h=1; 
= 4 ao + E(X? | Ft) + 2X? + Bi E(o7,1| Fe), = 2; 
ao + E(X? 4 4 | F1)  o9EQX2, 4 5| Fe) + HE (ofp nal Fe), hz 3; 
ao + 01X2 + a2X? + £192, k=l; 
= { ao + (03 + f1) E(02,4 |) + o3 X2, EE 


ao + (ay + £1) Elo ni | Fi) + a2 E(02, 4. |F), R283. 


Letting 71 = o1 + £1 and sip = Elo? in | Fz), h > 1, we obtain that 


ao + a1 X? +a2X2 +6102, h=1; 
Stra = 4 a + 18t41 + 02 X2, h = 2; 
Qo + V1St+h—1 + Q2St+h-2, h > 3; 

ag + o4 X? + a2 X21 + iot, h=1; 

= 4 ao(1 +71) + (ary +a2)X? + apy X2; + 61102, h= 2; 

Qo + V1St+h—1 + Q2St+h—2, h > 3. 


Since the terms s;41 and 5:49 are known, we thus have a recursion to compute Pex? h = SERS 
h>1. 

Note. One can also derive an explicit formula for s;,5, h > 3, in terms of s41 and st+2. To 
solve the second-order inhomogeneous difference equation 


Stth — Y1St+h—1 — O28t45-2 = Q0, for given 5441, St+2 (S3) 


we use the fact that s; is a solution if and only if s; is the sum of a particular solution and the 
solution of the second-order homogeneous difference equation St+h—Y1St+h-1—Q2St+h—2 = 0. 
Note that the constant s = ag/(1 — yı — o) solves (S3) (see also b)). To find a solution of 
the homogeneous difference equation, note that 


dI ELI Ei fr A= n a| 
St+h—1 St+h—2 Sty 1 0 


Computing the roots of the characteristic polynomial y4 (A) = det(AI3 — A) = M —41A — aa 
(where Ip € R2*? denotes the identity matrix) leads to the eigenvalues A15 = (Ga + 


v2 - 4a2)/2 of A. Solving (A — À1,512)z = 0 leads to the eigenvectors (à) and oe We 
thus have that 


HG 


Ah^-? 2 (EAP e? = TA? -2p-1 
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for 


BU Ag 0 tr. dd ji 25 
se (* 9). a= (8 2) aot me Lt. 


Plugging in and multiplying out leads to 


Stth | _ Ah-2 [88652 | _ 1 A = Am Ap — ARTIA S149 
St+h—-1 St+1 MA (APT? — ART MABT? — ARA) Asti 
from which we obtain 


NARE QT = AZ )sus - AXSOT? — AT sta 
tth eG 


for q = (M, — A§)/O1 — Ag) = DEn APA 175, | 1. Adding the particular solution, we 
obtain the explicit formula 


= Ch—181t4-2 — Ch—25t4+1 


L oyl 
a is u | 2 1. 
1-a — fı — Q2 At — A2 


Stth = S + Ch_18t42 — Ch_-2St41, for s = 


We use the notation as in a). If the limit s = limp_,. $447 exists, it must satisfy the recursion 
derived in a). Plugging in, we obtain s = ao + %18 + ags and thus s = ag/(1 — y1 — a2) = 
ag/(1 — a, — £1 — o2) as long as o4 + £4 + o» #1. Since s must be non-negative, we thus 
obtain the condition o4 + 6; + a2 < 1 which is also the covariance-stationary condition 
according to MFE (2015, Section 4.2.2). 


Note. We can also use the explicit formula derived in a) to obtain the covariance-stationary 
condition. We see from the explicit formula that it converges (and indeed against s = 
ag/(1—a1— £1 —02)) as long as |À1,5| < 1. Trivially, 4; > 0 > —1 and A, < 1 happens if and 
only if 4/42 + 4a2 < 2 — yı which in turn happens if and only if yı 4- o3 < 1 or, equivalently, 
ay + £1 + ag < 1. Under this condition also |A2| < 1 since, trivially, A € 71/2 € 1/2 and 
az < 1—71 implies that A» > (m- y — Ay +4) /2 = (y -|"1—2|)/2 = (OUu- (275)/2 = 
ON = 1=-1. 

Recall that a covariance-stationary GARCH(1,2) process has the form X, = oZ; for 


c2 = a + 01X2 4 + f102 4 + B2029, te Z, (S4) 


where (Zt)iez ~ SWN(0,1) and ao > 0, o1 > 0, 81 > 0, £5 > 0 with o4 + £1 + 82 < 1; see 
MFE (2015, Section 4.2.2). Note that (S2) holds as in a). With the notation as in a), we 
obtain 


ao + o1 X? + Bio? + b202, hi 
PX? p = St+h = § Q0 + V1St41 + B202, h = 2; 
Qo + Y1St+h—1 + b2St+h-2, h > 3. 


We thus obtain the same recursion as in a) with a replaced by £5 and thus limp_+.. P,X? "n 
IS OL as well as the covariance-stationary condition o + £4 + 69 < 1. 
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Solution 4.15 (Skewed innovation distributions) Ex. p. 31 
a) ô= 1 implies fy(x) = fx(x), x € R, so no skewness in comparison to the original density 

fx. For 6 > 1, x values to the left of 0 are scaled further out to dx and thus obtain less 

probability mass and x values to the right of 0 are scaled further in to 6~!x and thus obtain 

more probability mass. This introduces positive skewness or right-skewness (fy has a longer 

right tail in comparison to fx). A ô € (0,1) reverses the effect and introduces negative 

skewness or left-skewness (fy has a longer left tail in comparison to fx). 
b) Clearly, fy (x) > 0 for all z € R, 6 > 0, and fy is integrable. Substituting once y = dz and 

once y = ó-!z, as well as using symmetry of fx about 0, leads to 


ie fy(z)dz= s. fx (de) de [" ae az) 
i ng GEN fx(y) dyes fx(y) ay) 


2 ôT! 2 d$ 46 
"iar EC )dyt 5 d. Ix(y ay) = ig pu 0c c: 


By the same substitutions and the fact that Fx(0) = 1/2, we have 


2 x < 
Fy (x) ET ie Joe fx (dz) dz, : -i as 0, 
gius fx(6z) dz+ fo fx(d™ z) dz), r0, 
eee E E a « 0, 
5:3 (4 A9. fx(y)dy +6 d^ fx(y)dy), c» 0, 
_ a x <0, 
spat (2p + 6(Fx(6-1a) - 3) 2 >0, 
_ rig Fx (2), 2 « 0, 
+ F(s), x0. 


c) Substituting once x = dy and once x = 0^ 1y similarly as in b), as well as using symmetry 
of fx about 0, we obtain that 
2 uM i <i 
yal f xmas f yf dy 


0 oo 
= - (aor | z^ fx (x) dz + 6**1 h a^ fx (a) de) 
—oo 0 


= F(T [fetter + EE (7 ete az) 


Okt 4 —1)k§-(k+1) 
= OC Bx) 


for all k € N for which E(|X|*) < oo 


gY^) = 
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p. 


p. 


p. 


p. 


32 


32 


32 


32 
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d) Using c), the mean and variance of Y are given by 


627 — 6? 
~ 646-1 


(|X|) = (6 - 8 E(X), (S1) 


09467 
EET 


py — E(Y) 


of = var(Y) = E(Y?) - E(Y)? 


EX?) - (6-9 )EQXDY, — (82) 


respectively. Therefore, Z = (Y — uy)/oy has mean zero and unit variance. If X ~ N(0,1) 
then 


(xp =f isle) de = -2 [^ nox) dx = m [exp(—2"/2)] 


oo 2 2 
0 


where (x) = exp(—2?/2)/4/2n, x € R. We thus have that 


D (8 5-1 2 3 _ 8+0 ( aW 
uy = (6-6 WE and Be ET (6—8 7) z^ 


e) If X — t, and f(x) = c,(1 +22/v) F, x E€ R, for c& = ee, we have 


E(|X |) 


vu 2vocy 
0 


E etmas [^ f as ve [- L1 ez) 


1 
_ We T(v- 1/2) 
v—1 Vnl(v/2) ` 


With E(X?) = var X = z£5 if v > 2, one can compute py and of from (S1) and (82), 
respectively. 


Solution 4.16 (Fitting GARCH models to equity index return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial. org. 


Solution 4.17 (Fitting GARCH models to weekly equity index return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial. org. 


Solution 4.18 (Fitting GARCH models to foreign-exchange return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial. org. 


Advanced 


Solution 4.19 (AR(2) process) 


a) (Xi)iez is an ARMA(2,0) process with corresponding polynomials olz) —1- $12 — daz? 
(for the AR part) and 6(z) — 1 (for the MA part) which have no common roots. Therefore, 
(X;)iez is stationary and causal if and only if ¢ has no roots in the unit disk D = {z € 
C :|z| € 1}. Note that ó(z) = 1 — $1z — $222 = (1 — Ziz)(1 — Zoz) with % + % = $4 and 


£129 = —$». Furthermore, $(z) = 0 if and only if Z; = 1/z for at least one i € (1,2). 


150 


= 


Advanced 


If ó(z) £0 for all z € D, then % Z 1/z for all z € D and both i. So |Z;| < 1 for both i 
and thus |¢2| = |2122| = |Z1| - |£2| < 1 (so i) must hold). Moreover, since (0) = 1 > 0 and 
b(z) £0 for all z € D, we have |ó(2)| > 0 for all |z| € 1, in particular ¢(41) > 0. Therefore, 
1— $1 — $2 = G(1) > 0 (so ii) must hold) and 1+ $1 — ¢2 = ¢(—1) > 0 (so iii) must hold). 
Note. Sufficiency can be shown by showing that under the given conditions, |Z;| < 1, 
i € {1,2}. Since |Z3Z9| = |ó9| < 1, |Zi| < 1 or |Zo| < 1, without loss of generality assume 
|Zi| < 1. First consider the case that 2; € R, so -1 < Zj < 1 or 1 +3 » 0. Then Z must 
be real since 24 + Z2 = $4 € R. The assumptions imply that 1 > $2 + $1 = 4 + 29 — £129, 
so 0 > 2; 4+ 2 — Z0129 — 1 = (1 ZA) 22) or (1 cs 21 = 29) >0. As1-—-ã > 0, we 
obtain that 1 — Z > 0 or Z < 1. Similarly, 1 > $9 — $1 = —% — 22 — £129 implies that 
(1+ %)(1+ Z9) > 0 which leads to 1+ Z2 > 0 or Z2 > —1. Therefore, |Z| < 1. Now consider 
the case that 21 ¢ IR. Then % 22 = —$» € R implies that 29 = 21, so |Z2| = |Z| < 1. 

The set forms a triangle with corners (1, 62) € ((—2, —1), (0, 1), (2, 21). 

As (X;z)tez is stationary, we have 0 = E(e,) = E(X;) — $44E(X4 1) — 992E(X4 3) = (1 — $4— 
$3)E(X1) = (1— $1 — à2)u for u = E( Xi), and thus that u(t) = u = 0 for all t € Z since 
1— $1 — $2 #0. 


The autocovariance function y must satisfy 


y(h) = cov(Xt+n, Xt) = E(Xi4r Xt) = E((O1Xt+n—1 + b2Xt+n—2 + €) Xt) 
= Pi E(Xt4n—1Xt) + boE(Xt4n-2Xt) + Eleth Xe) = G1y(h — 1) + day(h—- 2), REN. 


Dividing by 7(0) and using symmetry, we obtain that the autocorrelation function p of 
(Xt)tez must satisfy 


p(h) = dip(h — 1) + ó»p(h —2), hEZ. 


Clearly, p(0) = 1. Choosing h = 1 in the recursion and noting that p(—1) = p(1) (by 
stationarity), we obtain p(1) = ¢1-1+4+ $2p(1) and thus p(1) = ¢1/(1 — $2). This allows one 
to compute p(h) iteratively for all h € Z. 


Solution 4.20 (EGARCH(1,1) model) 


a) 


Let g(Zi-1) = a1 Zt-1 + V1 (|Zt-1| — E(|Z:-1])), so that 


log(o2) = ag + g(Zi-1) + Ki log(o2 4), t c Z. 


If Z:-1 > 0, then g(Z:_-1) = (o1 + 53)Zi-à — V1 E(|Z_1]|) which is linear with slope o4 + 71. 
If 2:1 < 0, then g(Z:-1) = (a1 — 1) 2-1 — V1 E(|Z_1|) which is linear with slope a, — 71. 
Through suitable choices of o4, 71, the function g thus allows the conditional variance 
to change with (Z)zez and thus (X;)iez. For example if o; = 0 and 7 = 1, then large 
absolute innovations |Z; 1| > E(|Z, 1|) increase the conditional variance and small absolute 
innovations |Z;—1| < E(|Zi-1|) decrease the conditional variance. 


Because £; only depends on Zi; 1, (&t)tez is a sequence of iid random variables and thus a 
strict white noise (SWN). Furthermore, E(e+) = o4lE(Zi 1) + n (IE(| Zi 1|) — E(|Zi-1])) = 0, 
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t € Z, and 


var(e;) = a? var( Zi 4) + 32 var(|Zi i| — E(|Z:-1|)) + 20351 cov( Zi, | Ziel) 


Since var(Z;_1) = 1, var(|Zi-1| — E(|Z¢-1|)) = var(|Z+-1|) = E(Z2 4) — E(|Z_1|)? and, by 
the assumption of symmetry, cov(Zi 1, |Z:-1|) = E(Z:-1|Z_1|) = 0, we obtain that 


var(e¢) = af + V (E(Zj_1) - E(IZea?) = af +71 -EIZ tez. 


Since Y; = ao + 61¥:-1 + €t, t € Z, and, by b), (€t)tez is a strict white noise, this is an 
AR(1) process; if |81| < 1, it is strictly stationary. 


Under covariance stationarity, we must have wy = IE(Y;) = ao + 8&JE(Y; 1) = oo + bity 
and thus uy = ag/(1 — £81) as long as £1 Z 1. As (£i)iez is a strict white noise, we only 
need to guarantee that the variance of Y, exists and does not depend on time. Since 
c2. = var(Y;) = var(ao + b1Yi—1 + £t) = 62 var(Yi 1) + var(et) = 8202. + var(et), we must 
have 


o2 = Sars) | +111. EZ?) 

2 = = 
Le 1- BF 

as long as |81| < 1. So (Yi)iez is covariance stationary if |61| < 1 with mean py and 

variance es 


Note. For a standard normal distribution, one has E(|Z;|) = \/2/r. 


Ex.p.33 Solution 4.21 (GJR-GARCH(1,1) model) 


a) 


b) 


The effect on volatility depends on the sign of X41. If y1 > 0, then the conditional variance 
is larger if X; 4 < 0 than if X4, > 0. If 41 < 0, the effect is reversed. 

Assume (X;)tez is covariance stationary. Clearly, we have E(.X;) = E(o,Z;) = E(o4)E(Z;) = 
0, t € Z. Furthermore, 


E(o?) = E(X?) = E(o7)E(Z?) = E(o?) 
= ao + o4EE(X2 1) + WE ¢x,_, <0} Xt) + PiE(o7_1). 


Because of the stationarity requirement, we must have E(X? 4) = E(X?) = E(o?) and 
E(c? 4) = E(o?). Also, E(x, 1 c0) X1) = E(Itz, 1-0) 27-1971) =E Iiz, 1«0) 22.1) E(o?). 


We thus obtain that 


EloF) = ao + a1 E(o7) + 5h E(Itz, coy Z?)E(o7) + Bi E(o7) 
= ao (o1 + v E(I5, <0} Z2) + £1)E(o7). 


Solving for E(c2) we obtain that 


ao 
1 — (o1 + 'E(Itz, <0} Z2) + 61)’ 


var(X4) = E( X2) =E) = 


which is positive and finite if o4 + £1 + %1 E(Itz, 20 Z2) « 1. 
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c) If Z is symmetrically distributed about 0, then 


0 1 sex 
Bue Zt) =f fadi] Simo 


the stationarity condition is thus aj + £1 + 54/2 < 1. 


Advanced 


- 1 
E(Z7) = 53 


N| = 


d) For h = 1, the squared volatility recursion reads 07,1 = ao + (a1 + V11¢x,<0}) X? + 6197, 


which is known at time t. For h > 2, let 07 = 


2 . . 
of of, is given by 


E (07, | Ft) 


= ao + 01 E(X? n1 | Fe) 


E(I;z,<0}Z?). Then the optimal prediction 


sare al E(LX¢4n-1<0}Xtph-1 | Fe) + Pr Elo hhi | Ft) 


=a + 01 EloF na |F) t E(rz, s ae) ZI A1 | Ft) 


= ao + (a1 + 718z £1) 


E(02,5. 4 | Fe) 


Iterating this we obtain the formula 


E(07., 1, | Ft) 
h—2 


= œ 9 (e + mz + 61)? + (a1 + mz + Bi)" 


k=0 
h—1 


Elof n | Fe) + Bi EloF pni | Ft) 


E(07,1 | Ft) 


= ag (o1 + 719z + b1)" + (o3 + 710z B1)" (a + x<) X? + b102) 


k=0 


which is valid for all h > 1. 


Note. As for the GARCH(1, 1) model we see that under covariance stationarity, the predicted 
squared volatility converges to var( X+) for h — oo. 
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Review 


Solution 5.1 (Maximum domain of attraction) 

a) Fisher-Tippett-Gnedenko Theorem. 

b) Let X; ^^ F, i € N, and M, = max(Xi,..., Xa), n € N. Then F € MDA(H;) if there 
exist sequences (Cn)nen € Rso, (dn)nen € R such that 


Mn — dn d 
ZE a 


Ch noo 
or, equivalently, 
P((Mn — dn)/¢n € x) = F” (cnz + dn) Nun H(z) 


in all continuity points x of H. 
In words, F belongs to the maximum domain of attraction of He, € € R, if properly 
location-scale transformed block maxima of iid random variables of F converge in distribution 
to H. 
c) The fact that F € MDA(Hz) allows one to model maximum losses from an unknown loss 
distribution function F based on the limiting GEV Heg,,5(x) = He((x — u)/o), where u, o 
can be estimated from data. 


Solution 5.2 (Characterization of the MDA of a GEV for positive shape) 
By Gnedenko's Theorem, see MFE (2015, Theorem 5.8), F € MDA(H¢) for € > 0 if and only if 


F(x) = z-VL(z), 
where the function L is slowly varying at oo, that is 


Jim Lj =1, forallt>0. 


Intuitively, F € MDA(He) for € > 0 if and only if the survival function F of F behaves, up to 
a slowly varying function, like a power function for large zx. 
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Ex. p. 35 
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5 Solutions to Extreme Value Theory 


Solution 5.3 (Block maxima method) 

a) In the block maxima method (BMM), all iid observations, say N, are partitioned into k 
blocks of size approximately equal to n = [N/k]. For each block, the maximum of the 
(roughly) n observations is computed. These block maxima are assumed to approximately 
follow a generalized extreme value (GEV) distribution by the Fisher-Tippett-Gnedenko 
Theorem, see MFE (2015, Theorem 5.3), if the block size n is sufficiently large. The 
estimation of the GEV parameters £, u and ø is typically done by maximum likelihood; 
method-of-moments estimators are available, too. 

b) A particular challenge with the BMM is a bias-variance trade-off due to the choice of the 
block size n: If n is chosen rather large, the GEV distribution is a good approximation 
to the distribution of the block maxima (by the Fisher-Tippett-Gnedenko Theorem, see 
MFE (2015, Theorem 5.3)) which reduces the bias of the estimator. However, the number 
k of block maxima is forced to be rather small in this case (since N ~ nk is fixed), so the 
variance of the estimator increases. If n is chosen rather small, the bias increases (as the 
limiting GEV approximation is less accurate or even not valid at all), but the variance of 
the estimator decreases (due to more block maxima). Hence the necessity for a trade-off 
between bias and variance. 


Solution 5.4 (Excess distribution and mean excess function) 

a) The excess distribution function over the threshold u is given by 
Pu<X<a2+u) F(zx+u)-— F(u) 
P(X>u — I1-F(u) 
and the mean excess function is the mean of this distribution as a function of u. 


b) If X ~ Exp(A) with distribution function F(x) = 1 — exp(—Az), x > 0, it follows from (S1) 
that F,(r) = F(x) for all u > 0. 


c) By integration by parts, 
elu) = E(X —u|X >u) =) rAe "dy = [een] +f e ^* dz -[ e dz 
0 0 0 0 
1 
X 
Solution 5.5 (VaR, ES and the mean excess function) 
We show that 


Pax) = P(X - asa) X uw = (S1) 


ES4(X) = e(VaRa(X)) + VaRa(X), o € (0,1), 


where e(u) denotes the mean excess function. Since E(|X|) < oo, the mean excess function is 
defined by e(u) = E(X — u| X > u). We thus have that 
e(VaR4(X)) + VaRa(X) = E(X — VaR4(X) | X > VaRa(X)) + VaRa(X) 
= E(X — VaR4(X) + VaRa(X) | X > VaRa(X)) 
= E(X |X > VaR.(X)) = ES,(X), 


where the last equality holds since F is continuous; see MFE (2015, Lemma 2.13). 
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Solution 5.6 (Peaks-over-threshold method) 


a) 
b) 


Pickands-Balkema-de Haan Theorem. 


The peaks-over-threshold (POT) method uses the excess losses over a sufficiently large 
threshold u (chosen when applying the method). By the Pickands-Balkema-de Haan 
Theorem, see MFE (2015, Theorem 5.20), the excess losses over a sufficiently high threshold 
u approximately follow a generalized Pareto distribution (GPD), the parameters of which 
can then be estimated by maximum likelihood after u is chosen and the corresponding 
excess losses are computed. 


A particular challenge with the POT method is the choice of the threshold u: If it is chosen 
too large, the bias decreases (due to the more accurate approximation to the limiting GPD; 
see MFE (2015, Theorem 5.20)), but the variance increases (due to the smaller number 


of available excess losses over the threshold u based on which the estimation is done). 


Similarly, if the threshold u is chosen too small, the bias increases (as the excess losses 
over the threshold u may not be well approximated by the limiting GPD), but the variance 
decreases (due to a larger number of excess losses over the threshold u). 


'The POT method makes a more efficient use of the extreme data since it uses all extreme 
values exceeding a suitably large threshold and not just the maximum of each block as in 
the BMM (which neglects other sufficiently large values in those blocks or which might be 
too small to be considered extreme in comparison to losses in other blocks). 


Let X, X1,..., Xn ^* F denote losses and X1,..., X N, denote those N,-many losses among 
X1,..., Xn which exceed the threshold u. If u is chosen appropriately such that the excess 
loses X1 — u,..., X N, — u can be viewed as an approximate random sample from a GPD 
(so, approximately, Fu = Gg for some € € R and 6 > 0, where F, denotes the excess 
distribution function over u) then 


F(x) = P(X z)-P(X»wuP(X»z|X»u) 
= F(u)P(X — u > z — u| X > u) = F(u)F,(2 — u) 
7 y —ux-VE 
= F(u)(1+é [ ) "E it (S1) 


So the key idea is to express F(x) beyond u via F(u) (a term which can be estimated 
within the data) and F,(x — u) (which can be approximated by a GPD); see MFE (2015, 
Section 5.2.3). 


Note. The actual tail estimator can be obtained by (non-parametrically) estimating F(u) via 
N,,/n and (parametrically) estimating the GPD parameters £, 8 via maximum likelihood; 
see Smith (1987) and MFE (2015, equation (5.21)). Note that this estimator and (S1) are 
only valid for x > u. 
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5 Solutions to Extreme Value Theory 


Ex. p.36 Solution 5.7 (Threshold choice in the peaks-over-threshold method) 


a) 


From the mean excess plot of the losses on the left-hand side of Figure E.5.1, a threshold u 
of about 1 seems to be appropriate. The mean excess plot is fairly linear overall, so lower 
threshold choices would be possible, too, but we decided for the threshold choice u = 1 
here due to the slight kink at that value. This choice leaves enough excesses for applying 
the peaks-over-threshold method while larger choices of u seem to already be affected by a 
smaller sample size (the mean excess plot can become unreliable for large thresholds). 

From the mean excess plot of the real losses on the right-hand side of Figure E.5.1, a 
threshold u of about 0.05 seems to be appropriate. For smaller u, the mean excess plot 
shows several kinks and cannot be considered linear thereafter. After u — 0.05, the mean 
excess plot seems fairly stable, only affected by the small sample size for larger thresholds 
(see around 0.08 and beyond 0.1). 


Note. The mean excess plot on the left-hand side of Figure E.5.1 is constructed from an iid 
sample of size n = 10000 of G,/5.1/,9. Note that there is thus no theoretical reason for the 
slight kink at around 1 or the non-linearity for larger thresholds; this appears purely by 
chance and can be much more severe for smaller sample sizes. 

The mean excess plot on the right-hand side of Figure E.5.1 is constructed from the 
n — 430 positive losses of the 877 losses of the price of 1 Ethereum in USD between 6 
August 2015 and 31 December 2017. 


A Ge g has mean excess function e(u) = (8 + £u)/(1 — £), see MFE (2015, equation (5.13)), 
so the slope of the mean excess function of the approximate Gg g to Fu is £/(1 — €). The 
slope of the mean excess functions shown in Figure E.5.1 is roughly 1 and 1/2, respectively, 
which corresponds to GPD shape parameters € of 1/2 and 1/3, respectively. 


Note. A shape parameter of £ = 1/2 matches with how the losses for the mean excess plot 
on the left-hand side of Figure E.5.1 were generated, namely from a G1/21/19 distribution. 
A shape parameter of € = 1/3 for the plot on the right-hand side of Figure E.5.1 is fairly 
close to the maximum likelihood estimator 0.3062 (obtained numerically). 


Ex. p.36 Solution 5.8 (The tail index and its estimation) 


a) 


b) 


By Gnedenko's Theorem, see MFE (2015, Theorem 5.8), F € MDA(He) for € > 0 if and 
only if F(x) = az VE pa) for all x, where L is slowly varying at oo. The exponent 1/£ is 
the so-called tail index (and denoted by 6). 


There are a number of possible approaches. Simple methods include fitting a GEV to block 
maxima or a GPD to threshold excesses and taking the reciprocal of the estimator of €. 
There is also the regression approach. The main idea of the regression estimator is that 
log F(x) = —@log(a) + logc is a linear function in log x based on which one can estimate 0 
via fitting a regression model. 
Another method is the Hill estimator. Assume that the slowly varying function satisfies 


ind. 


L = c so that F(x) = cx? for all x. Let X,X1,..., Xn ^v F. The main idea of the 


Hill estimator is that for a sufficiently large threshold u, the mean excess function e of 
log X at logu is 1/0 based on which one can estimate 0 from the logarithmic sample 
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log X1,...,log Xn by using the corresponding sample mean excess function e, at log u for 
u = Xx» for a sufficiently small k € (2,...,n), where X1, > + > Xnn. To see this 
connection, use the formula for e derived in Exercise 5.19 with the survival function Fog x 
of log X given by Fig; x(x) = F(exp x) = cexp(—6z) to see that 


1 ud 1 [e 
e(log u) = Kol Frog x(x) dx = cec 3) [exp( dx)| 


œ l 
u 
so that 1/en(log u) is an estimator of 0. 


Note. In the general case (L slowly varying at oo but not constant), one can argue the same 
way by invoking Karamata's Theorem; see MFE (2015, Theorem A.7). 


Solution 5.9 (The extremal index) 
Let (X;);ez be a strictly stationary time series with stationary distribution F and let (X;);ez 


denote the associated iid process, that is X; ^" F, i € Z. One can often show that properly 


location-scale transformed block maxima of (X;);c7 converge in distribution to a non-degenerate 
limiting distribution H if and only if properly location-scale transformed block maxima of 
(X;)iez converge in distribution to H for some 0 € (0,1]. Such 0 is then known as extremal 
index; see MFE (2015, Section 5.1.3). 


Basic 


Solution 5.10 (Max-stability of the Fréchet distribution) 
We have 
P((Mn — dn)/en € £) = P(M, € cnt + dn) = P(X € cna + da,..., Xn € ent + dn) 
= F” (cnz + dn) = exp(—n(enx + diy) 


= exp(—(cnn7 V? + m7 V0q,) 9) 
which equals F(x) = exp(—a~®) for all a > 0 and any n € N if c, = n!/? and dp — 0, n € N. 


Solution 5.11 (A continuous distribution for which maxima do not converge to a GEV) 
Fx ¢ MDA(He) for € « 0 (Weibull domain of attraction) as Fx does not have a finite right 
endpoint. Also, Fx ¢ MDA(Ho) (Gumbel domain of attraction) as not all moments of Fx 
exist. Finally, by Gnedenko's Theorem, see MFE (2015, Theorem 5.8), Fx € MDA(He) for 
E > 0 (Fréchet domain of attraction) since Fx(z) = Fy(logz), so Fx(x) = Fy(log x) which 
cannot be written as a power times a slowly varying function at infinity. 


Solution 5.12 (Double logarithmic tail plot) 


a) By Gnedenko's Theorem, see also Exercise 5.2, 


(logz;, log F(z;)) ~ (logxi, —(1/£)log(z;)--logL(z;), i€11,...,n] 
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Ex. p. 37 


Ex. p. 37 


5 Solutions to Extreme Value Theory 


such that, for sufficiently large z;, the values —(1/€) log(x;) + log L(r;) in the second 
coordinate are approximately (up to log L(z;)) a linear function with slope —1/£ of the 
values log x; in the first coordinate. Therefore, for sufficiently large x;, the log-log plot 
becomes approximately linear with slope —1/€. 


b) Let Xi,..., Xn ®© F € MDA(H;), € > 0 (random variables of which z,...,2, are 


realizations) and Xin 2 ++: 2 Xnn be the corresponding decreasing order statistics. 
Furthermore, let F(x) = x-VL(x), where L is a slowly varying function at oo. Moreover, 


let k be the largest index such that Xk n > u. If F denotes an estimator of F, a) implies 
that the points 


(log Xin, log F(X;5)), i€(1,..., k], 


roughly lie on a line with slope —1/€ and intercept log L(x). For F being the empirical 
survival function 


Eo: bs ig 
F(a) = — 5 Ipae) = 72 Ipso) 
l=1 l=1 


(where we used ‘>’ rather than ‘>’ to be less wasteful with the data) we have F(X;4) = i. 
Therefore, fitting a regression model to 


(log Xin, log(i/n), $£€1L...,k), 


and taking the negative value of the estimated slope provides a semi-parametric estimator 
for the tail index 0 = 1/€. 


Ex. p. 37 Solution 5.13 (Uniqueness of limit up to location and scale) 
For the given c, = F(1— 1/n) = «((1 — (1— 1/n))- V — 1) = «(nl/e — 1) and d, = 0, we 
obtain that 


F” (cng + dn) = (1 (— uaa 1)) a) 


“(> Gres) E (erectam) ) 
=(1 E e 


n 


28 


> exp(-z ")— Hysiis(z) © > 0. 


n—»oo 


We thus have location u = 1 and scale e = 1/a. 

Note. 

1) For a justification of the last convergence, see the hint in Exercise 5.23 a). 

2) The two limiting distributions Hija = H1/4,9,; and Hj/4 1/4 share the same shape parame- 
ter € = 1/a but differ in the location and scale parameters u and c. By the Convergence to 
Types Theorem, this is the only way two limiting GEV distributions can differ and one can 


always choose normalizing sequences such that the limiting GEV appears in standardized 
form, that is as Hc (with u = 0 and o = 1). 
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Solution 5.14 (GPD with negative shape parameter) Ex. p. 37 
For € < 0, Ge g is concentrated on [0, —8/£] and thus the distribution of Y is concentrated on 
[0,1]. For y € [0,1], we have that 


Fy(y) = P(Y € y) = P(-£X/8 € y) = P(X < —8y/£) = Gea(—By/E) 
(1+ £(—8y/0)/8) "E 21- (1 — y) * 


tees 
¿<0 
with corresponding density 
1-1 (1 = y) V&-1 yd = y) V&-1 


EM: — ,\-1/é-1 _ Y _ 
fv()- z0 y) = -E - PR ^ 


where we used that 


TUONO — TCUO 2. 

Tr(1—1/£) | (-1/9T(-1/£€) 
Therefore, Y ~ Beta(1, —1/£) with density fy(y) = y^^! (1 — y)-1/8(a,b), y € (0,1), fora = 1 
and b = —1/£ > 0. 


B, -1/€) 


Solution 5.15 (Slow variation, regular variation) Ex. p. 37 
a) We have to show that limz-... L(tx)/L(x) = 1 for all t > 0. 
i) Since 
c uix) . 2+ cos(1/(tx)) 2+ cos(0) 
2-309 L(z) 2300 24 cos(1/z) 2 + cos(0) / je S 


L is slowly varying at oo. 


ii) Since 
L l l l logt 
im ES) = lim ECD = im SBOE? = tim (SÉ 41) =1, #0, 
zoo L(x) r>% log ax 2-00 log x «oo \ log x 


L is slowly varying at oo. 


iii) Since 
E 
Eas L(tx) Sn log x -( : se) tp psi 
zoo L(x) 2-00 log(tax) zoo loge ii) 
L is slowly varying at oo. 
iv) Since 
L(t L(1 L(x(1/t 
e ERAI MENTA 
zoo L(x) s>% L(1/x) a>0+ L(x) 


L is slowly varying at co. 
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b) We have to show that lim; ,s5 h(tx)/h(x) = t* for all t > 0 and some a > 0. 


i) Since 
0 
Co) NT t°, t» 0, 
q2—00 h(a) z—00 g~? 
h is regularly varying at oo with index a = —0. 
ii) Since 
_ h(tz) |. (14 tz)? ( zm p" 
= l —— > — = 
AER ese)? wisi) =o 99 
h is regularly varying at oo with index a = —0. 
iii) Since 
h(t hf h(a(1 
hm LS qu MUTO quu NEUEM age. desi 


h is regularly varying at oo with index —a. 


Ex. p. 37 Solution 5.16 (GEV limit for maxima from the GPD) 
The generalized Pareto distribution (GPD) is given by 


1—(146z/8)-U5, £7 0, 


Gea) = l —exp(-2/8),  €=0, 


where f > 0, and the support is x > 0 when € > 0 and x € [0, —8/£] when € < 0. 
i) Let €>0. Then Ge g(x) = (14 £x/8) V6 = z-VSL(a) for L(x) = (€/8 + 1/2) V6, x > 0. 
Since 


L(t) | , (£/8-1/(tz) V5  . (&/gy-V6 


JU La) 4e (E/B + 1a E T 23% (eyp)ie D I7 


L is slowly varying at oo and thus, by Gnedenko's Theorem, see MFE (2015, Theorem 5.8), 
Ge g € MDA(H,;), € > 0. 


ii) Let X1,..., Ao. Go,g = Exp(1/8). For cn = 6 and dn = logn, n € N, we have that 


O E E E E eto d (1 tese (ene Ë %0) 


—a—logn\ n —2“\n 
= (1 — — = (1 E ) > exp(-e ^) «ER, 
n n noo 


which equals Ho(x), so that, by definition, Go, € MDA (Ho) for all 6 > 0. 
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iii) Let € < 0 and note that the right endpoint zc, , of Ge g is £G; , = —3/€ in this case. Then 
Ge,a(@aeg — 1/2) = (1+ £(-8/€ — 1/2)/8) VF = (—E/(a)) 5 = FL (a) 


for L(x) = (—£/8)-V5, y > —£/8. Since L is constant, thus slowly varying at oo, 
Gnedenko's Theorem, see MFE (2015, Theorem 5.10), implies that Gg g € MDA(He), 
€ « 0. 

Solution 5.17 (GEV limit for maxima from the Burr distribution) 

Note that F(x) = 1 — F(x) = (k/(&-4- z&))^ = x-9^L(x) for L(x) = (ER x > 0. Since 


K A A 
502) - (m SERES -(i) =1, t>0, 


— 
EIS k/x?+1 


ji 
226 L(x) 


L is slowly varying at oo and thus, by Gnedenko's Theorem, see MFE (2015, Theorem 5.8), 
F € MDA(H¢) for € = 1/(02). 


Solution 5.18 (The Weibull distribution as a special case of the GEV) 
a) We have 
Fy(y) = P(Y < y) = P6X € y - 1) = P(X > (y = 1)/8) = 1 — He((y — 1)/6) 
= 1 — exp(-y- 5) 

for all y such that (y — 1)/£ < —1/£, so y > 0. 

b) By a), the density fy of the distribution function Fy of Y is 
1 1 
fv(y) = pS exp(—y 8) = B x5 exp(-y 5), y > 0. 
By comparison of fy with the claimed form of the density, we see that c and y have to be 
chosen as c= 1 and y = —1/£. 


Solution 5.19 (An alternative formula for the mean excess function) 
a) Since E(|X|) < oo and thus fj" zdF(z) < oo, we have that 


Eyes opea mre Ep) J. ETOR / par) n 6H) 


£ TTF 


Furthermore, for all z € [0, zr — u), we have that 


F(x+u)— F(u) 1-Fle+u) _ F(z +u) 


F(z) =1—F,(z) = = = S2 
(2) (v) 1— F(u) 1- F(u) F(u) PA 
With the substitution y = x + u, we obtain that 
7 F F - 
lim ux) - lim jb = lim (y— u) C MN ULT 0. (S3) 
z—mTrp—u x—rp—u F(u) y—cz4u yup F(u) (S1) F(u) 
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Now fix u € [zp, rp). Then, for all x € [0, zp — u), 


e(u) = E(X —u| X >u) = ec) = -[^ zaf) 


H d a) de S i x de zu F m L iud 


b) = Let € € (0, 1). Part a) implies that for u € [0,00), 


1/6 
= E M Pis =a ue a aa 
)- V6 l (0-(1+ 1-1/€ 
A d T ogg OTA MN NUN 
a i" (1 + &u/8) V6 =e 
_ Bc £u 
ap 
=» [et £ — 0. Part a) implies that for u € [0,00), 
sy qui? iy eI de = B = ze. 


= Finally, let £ < 0. Part a) implies that for u € [0, —5/£), 


—B/ 
elu) = (e eu) f oet) as 
B 4- £u 


P ] 7 
zu + 6/8) ETE (1+ £u/B) 13) = Wen 


Note that in all three cases, e(u) — (8 -- £u)/(1 — £) for all u between the left and right 
endpoints of the distribution function of the Gg which verifies MFE (2015, equation 5.13). 


Ex.p.38 Solution 5.20 (Scaling of VaR, ES under a power tail) 
a) Let u € z€y «oo. Then 
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With x = F* (o1) = VaRa, (X) and y = F* (o2) = VaRo4(X) for F(u) < o1 € ag < 1, 
one obtains that 


VaRa,(X) = (= 


Since F(a) is continuous for x > u, F(F“(a)) = 1—F(F*(a)) = 1— a for all a € (F(u), 1) 
and thus 


b) Let 6 > 1. We know from a) that VaR;(X) = (2202) ig + VaRo,(X)) — & for all 
F(u) <a, € z < 1 and thus 
1 


1 
ESa (X) = = x VaR4(.X) dz 
a2 


1— 1/0 pl 
(k + VaRa, ao i AU Í (1— z)-V* dz — & 
2 a2 


(1 es 1 1 


(1 2 -K 


= (k + VaRa,(X)) 


1 — 02 1— 1/0 a2 
= (l-ai)'/? 1 1-1/0 
= (k + VaRo, (X)) =e 1-1/6" a2) K 
= l-a, 1/0 0 
= (r VaR GO) (124) T5 


c) Assume we have losses following F with power tail F(x) = c(& -- z)-9, x > u. The scaling 
results for value-at-risk and expected-shortfall allow us to estimate VaR,, ES q, at lower 
confidence level o4 for F(u) < o4 € ag < 1 and to compute from there results for confidence 
levels ag > o in order to obtain estimators of VaR44,, ES4,. This way one can even obtain 
estimators of VaR4,, ESa, for confidence levels ag corresponding to quantiles beyond the 
largest data point, where estimation would otherwise not be possible. This ‘estimation 
in the body of the data and then scaling up' lies at the core of statistical applications of 
extreme value theory. Note that this is only meaningful if indeed F(x) = c(k + x) ^9, x > u. 


Solution 5.21 (Block maxima method applied to S&P 500 return data) Ex. p. 39 


a) The maximum likelihood estimator Om of 0 = (£, 1,0) is Om = (0.5278, 0.0223, 0.009), 

where m denotes the number of yearly block maxima. Since, for € > 0 and X ~ He m0, 
E( X^) = oo if and only if k > 1/£, moments k > 2 of the fitted GEV do not exist, so the 
fitted distribution hints at an infinite variance model. 


b) Let Mn rn He,, and zmax be the maximal loss over the past 20 years. Then 


P(M,, zt iss) =1- Hg. usse) = Ho, (max), 
which, based on the max = 0.0711, can be computed as 7.95%. 
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Ex. p. 39 Sol 


Note. Asking for the exceedance probability of all previous losses would result in the much 
smaller probability of 0.83%, because of the large loss on Black Monday, 19 October 1987. 
The k n-block return level rp, is given implicitly by P(Mn > rn,k) = 1/k, that is the 
(smallest) level which is expected to be exceeded (at most) in one out of every k blocks 
of size n. Since in the block maxima method, approximately, M, ~ He,, we have 
1/k =P(Mn > ra) = He, (rk) and thus fn, = Hg, (1/k) = Hg (1 — 1/k). Based on the 
fitted GEV distribution, we obtain 260,10 = 6.32% and 260,50 = 14.50%. 

The return period knu of the event {Mn > u} (that is of a loss at least as large as 
u) is the smallest number of n-blocks for which we expect to see at least one n-block 
exceeding u. Since in the block maxima method, approximately, Mn ~ He, we have 
l/k,, = P(M, > u) = He, (u) and thus kn», = 1/Ho,,(u). Based on the fitted GEV 
distribution, we obtain k260,0.0922 = 20.78, so a return period of about 21 years of a loss as 
seen on Monday, 29 September 2008. 

Note. Although the return period describes an average behavior rather than that the event 
considered happens exactly every so-many years, the latter is, by chance (!), indeed the 
case here. 21 years prior to 2008 was the last time an event with a loss at least as large as 
9.22% happened, it was the Black Monday event. 


ution 5.22 (Peaks-over-threshold analysis of Bitcoin data) 


R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial. org. 


Advanced 


Ex. p. 40 Sol 
a) 
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ution 5.23 (Alternative condition for convergence of normalized maxima) 
Let £n = Cnt + dn and z = — log H (x). We thus need to show that 


Pax) MA exp(—z) if and only if nF (24) an z. 
By applying log(-), we obtain that the left-hand side is equivalent to 


nlog(1— nF(zn)/n) = nlog(1 — F(zn)) = nlog F(zn) = log F” (£n) fcrc 


and, trivially, the right-hand side is equivalent to —n£ (z,) E T We can thus equiva- 
lently show that 


nlog(1— nF(an)/n) ieee if and only if —nF(z,) > —z. 


noo 
With an = —nF (xn) and a = —z this is equivalent to 
nlog(1 + an/n) noo d if and only if an VS 


which holds by the hint. 


Advanced 


Note. The hint can be shown as follows. Let an —^ a € R for n — co. First consider a = 0. 
For |an| € 1/2, one has 


n 


n n n—1 
» (esr s Y (tout la ea 
k=1 k=1 k=0 


I + as /n)^ — 1| = 


k=1 
n—1 
< Jan] 3, (1/2)* < 2lan| 0 
k=0 


o (1 +an/n)” — 1 = e? for n — oo in this case. Now consider a 4 0. By l'Hópital's rule 
(case ‘0/0’), lim; ,olog(1 + z)/x = 1. This implies that 


log(1 +an/n) _ an log(1 + an/n) sa 
ajn a an/n noo 


and thus that log((1 + a4, /n)") > a for n — oo. Continuity of the exponential function 
then leads to (1 + a;/n)" = exp(log((1 + a; /n)")) > exp(a) for n — oc. 

Conversely, let (1 + aj, /n)" — e° for n > oo and note that an/n — 0 since, otherwise, 
there is a subsequence (an,) C (an) along which a,,/n is either positive or negative and 
bounded away from 0 so that exp(a) — Jim (1 + an, /ni)"* € {0, co}, a contradiction to 
a € R. Furthermore, for x > 0, 


oo (—])kH 1H 
log(1 + z) = m Lg t=r+5 > gk = grol) 


k=1 
and thus, by assumption, 
a = logexpa = lim nlog(1 +an/n) = Jim n(an/n + o(as /n)) 
= i an+ Ji no /n) = Jin an+ Ji ano(an/)/(an/n) = Jim s 


where the last equality follows since (an)nen is bounded and o(a,/n)/(an/n) — 0 for 
n oo. 


b) Since 
nF (cz + dn) = nexp(—A cnt + dn)) = nexp(—(z + logn)) = exp(-«) = — log Ho(x), 
for Ho(x) = exp(—e~”), x € R, we infer that F € MDA(Hpo), that is F belongs to the 


maximum domain of attraction of the Gumbel distribution. 
Alternatively, this can also be shown directly here via 


F" (cng + dn) = (1 — exp(-A(exz + dn)))" = (1 — exp(-A(z/A + log(n)/A)))" 
—-(ü-e*/nf > exp(-e ") = Ho(z), v €R. 
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Solution 5.24 (Maximum domain of attraction of a log-gamma distribution) 
Since f (and thus F) has a power tail, an idea would be to apply Gnedenko's Theorem, see 
MFE (2015, Theorem 5.8), but we do not know F analytically. To this end, we first show that 
f is regularly varying at oo and then use Karamata's Theorem. 
To see that f is regularly varying at oo, note that 
[07 


Fla) rs logo) ia- = a P(e) 


for 


L(x) = T(a) (logx)*1, z»1. 


The function L is slowly varying at oo since, for all t > 0, 


2" (log(tz))*-! a-1 a-1 
mu UH ue Fe Ga) (aim Hee) = (1m ae” +1) =i, 
LOO L(x) £= 00 Tay (log z)e7! LOO log x zoo log x 


Therefore, f is regularly varying at oo. 
For fixed a, B > 0, note that L is locally bounded on [xo, oo) (for zo = e, for example). By 
Karamata's Theorem, see MFE (2015, Theorem A.7 (b)) for k = —(8+1) < —1, 
oo 
T 


F(x) = | ” faz = if 2 OT (2)de ~ ——a FP L(x) = a7? L(x) 


where a(x) ~ b(z) as x — oo means that limz—oœ a(z)/b(x) = 1. Since L(x) = L(x)/ is slowly 
varying at oo, Gnedenko's Theorem implies that F € MDA(He) for € = 1/6 > 0. 


Solution 5.25 (Maximum domain of attraction under serial dependence) 
a) We have that 


P(Yi € x) = P(max(Xi, X2} € z) = P(X € z, Xo € x) = P(X1 € x)P(X» < x) 
-—PFr-1-—az*, si 


Thus Y; follows a Pareto Type I distribution with parameter 0 > 0. 


b) Let Fo(x) = 1 — x7’ denote the distribution function of a Pareto Type I distribution with 
parameter 0 > 0. With c, = Ff (1 — 1/n) = nV? and dn = 0, n € N, we have that 


-nn 
P(MY < Vs) = FP (na) = (1 — (nz) = (1-2) g exp(-2), 


so HY (x) = exp(-z *) = Hi j9,3130(2), x € [0, 00). 
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c) We have that 


P( 4, < x) = P(max{ X1, Xo} < x) —P(Xq € xz, Xo € x£) = P(X: < P(X < x) 
= F?(£)=1- r, zzi 


Thus, as Yi, also Z4 follows a Pareto Type I distribution with parameter 0 > 0. 
d) Forz>1, 


P(Zn Sa, Zn+1 < x) = 


P(max{ Xn, X441) € 2, max{Xn41, Xn+2} < x) 
= P(Xn < fA < £, X254) 
= F?(x) x F'(z) = P(Z, € £) P(Zn41 < 2), 


so, although identically distributed, (Zn)nen is not a sequence of independent random 
variables. 


e) With c,,d, as in b), we have that 


P(MZ < nt? x) ep nV y. d nt? x) =P(Xı < n’ g, TO EE nV) 
= pti) zz (1 = (nV/$5)-9)0 0/2 


6s (n+1)/2 
-(-£—) > eer», 
n n— oo 


so H4(x) = (exp(—-z-9))!/? = B rnt = HY (x)! x € [0, oo). 


f) Clearly, H7 = (HY )!/?. The exponent 1/2 is the extremal index. 


Solution 5.26 (A link between the Poisson, GPD and GEV distributions) 
By the Law of Total Probability, 


Ex. p. 41 
P(My < x) = » (My €z,N —mn)-— Y PM, gH) 
n=0 n=0 
x P(M, € z)P(N = n) = pu A P S ge Yn < x) 
EAD AG 
- E? <a)" D eo exp(—A + AGe,s(z)) 
m _ fexp(-A  &x/8)7 5), £0, 
= exp(-A(1 — Geg(x))) = ded B=), (S1) 


We have left to show that (S1) equals H¢ „o (x) for the given u and ø. First consider € ¥ 0. 
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He yo (x) = He((x — u)/o) = exp(- (1 + E(x — u)/o) S) 
exp( (1 f= xU ir 2 = exp( (1 + 


BAS 
Cs 


éz 2 + J 5) 


ne = exp(—A(1 + £x/ B) V6), 


For € = 0, 
Heu) = exp(—e- (*-#)/7) = exp(—e7 (778 lee 01/8) 2 exp(—Ae~*/8), 


In both cases, we see that the distribution function of My in (S1) equals H;,,,; for the given u 
and o. 

Note. This is the ‘exact’ analogue of limiting results we know to hold under more general 
conditions: Let F be continuous and F € MDA(H;). Since (Xn)nen is a sequence of iid random 
variables, the exceedances X;,,..., Xi, and thus the excesses Y; = Xi, — u, k € (1,..., N}, 
over a threshold u happen according to a Poisson process, so N ~ Poi(A) for some A > 0. The 
corresponding excess distribution function Fy over sufficiently large u is approximately GPD 
with shape € and scale 8 (by the Pickands-Balkema-de Haan Theorem since F € MDA(H;)). 
And since F € MDA(H¢), My approximately follows a GEV distribution for sufficiently large 
N (by the Fisher-Tippett-Gnedenko Theorem). 


Ex. p. 41 Solution 5.27 (Convergence of the excess distribution for a gamma distribution) 


a) The density f of X is f(x) = 1?/T(2)z? ! exp(—1- x) = xexp(—z), x € (0,00), and thus 


F(x) = f no dz = | zexp( 2. ] ewa dz 
—x exp(—x) + 1 — exp(—x) = 1 — exp(—z)(1 + x), 


x > 0. 


Therefore the excess distribution function is 
F(a +u)—F(u)  —exp(-(z-Fu))(1- 2+ u) + exp(—u)(1 + u) 


Fy(a) = 1— F(u) E exp(—u)(1 + u) 
exp( nu = u)+1+u = 1 — exp( P Te x € [0, 0). 
b) We have that 
lim E |F (£) — Goa(a)| = jim, seus 1 — exp(—zx) 1 LI u (1 — exp( »y) 
= Him, sp ewo (i - 5) 
= dim, sup Jexr(—2) >, 
= lim sup _wexp(—2)/(1 +u) = Jim e™?/(1 +u) =0, 
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where we used that x — xexp(—2), x € (0,00), attains its maximum at x = 1. 


c) By the Pickands-Balkema-de Haan Theorem, see MFE (2015, Theorem 5.20), we conclude 
that F € MDA(Hp), so Ga(2,1) belongs to the Gumbel domain of attraction. 


d) By Exercise 5.19, 


OO Ge 1 
e(u) i (2) de = 


i exp(—z)(1-- 2+ u) da 


= is arro epla] ef. exp(—z) de) = lity +1) 
_2+4u 
EET 


Since limu elu) = 1, the mean excess function of X ~ Ga(2,1) converges, for large u, 
to that of an Exp(1) distribution which corresponds to Go. Thus the excess distribution 
function Fi, must converge to Go. By the Pickands-Balkema-de Haan Theorem, see MFE 
(2015, Theorem 5.20), we conclude that F € MDA(Ho). 


Solution 5.28 (Shortfall-to-range-shortfall ratio for a GPD tail) 


a) For all z € [u, zr), we have that 


F(x) = P(X >x) =P(X > u)P(X > z|X >u) = F(u)P(X -u > z,z -u| X >u) 


FoRo aa Bay [056 0875, E40, 
SEQUI ey fae ae £- 0. 


Solving F(x) = a for x for all a € [F(u), 1) leads to 


8 dea)~§ _ 
wan-r e [RS 
F(u)? = 


Let 0 «0; < ag < 1. With 


[a-atas- [3:a- 2-4 - Sed bea 


o1 


and 


Te log(1— z) dz = — o log y dy = — [y(log(y) = 1) e 


o1 1—o1i ay, 
= (1 — aj) (log(1 — o1) — 1) — (1 — ag)(log(1 — a2) — 1) 


we obtain from linearity of the integral that 


LB 1 1 a2 
ES Ub] = , E (zb O3—01 ie 


( 
u— xu fo? log(1 — z)dz — log F(u), €=0, 
B , B F(u)* Q-a1)t-£-(1-a2)t-€ 
LI 4 E +g 1-6 a m à a) 1). a Xdog(i jc € #0, 
u — log F(u) — Be eacus EC cei eed) 
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b) Let £ <1. For o4 = a and a2 — 1-, we obtain that 


B | B FQ)* (1-0)! 75 
ESa(L)= lim ESa,œ(L) = I eee: EET us 
02—1— Ur log F(u) B l—-« , € =U, 
F(u)* = 
_ Ju- 84+ 85820 -a)-, € 7-0, 
u — log F'(u) — B(log(1 - a) -13), £—0, 


where we used that lim; ,94 «log x = lim,_,9 log(x) /(1/x) = 0 by l'Hópital's rule (case 
*—oo/ oo). 


c) To keep the notation simple, introduce 


Consider 0 Z € « 1. By a), b) and the fact that 1 — ag = 1 — (1 — (1 - o)") = (1— a)”, 
we obtain that 


pm ESa(L) EE a 4- b(1— o)-$ E a + ba-* 
HL ESL erus) T ee veas e esp 7 aM a ga Can 
$ a + ba7& 
= 1m . 
a0 q + bag laa 20-9 


For € « 0, we immediately see that this limit is a/a = 1. For € € (0,1), we have that 


lim a + ba $ ari aas +b bo i 
o04- a I bas ea D E o04- aos + pica oe b i 


Now consider the case £ = 0. By a), b) and the fact that 1—a2 = 1—(1—(1—0)?) = (1-a)’, 
we obtain that 


a+ 6+ blog(1— o) 


ases 1—o)(log(1—o)—1)— (1—o)? (log((1—o)Y)—1 
o1-(1-o)(L) >I- q + p&o eet g((1—a)»)—1) 


ji a+6+bloga 
T aD q 4 patios (a)- Dan loela) 
NE a+6+ bloga 
p coim ploela)=1- a1 tog(a)=1) 
+8 
= RE "E 
B €— a p1 ise a at 3 (y OR ü b xm 
logo d 1—-o?-1 
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Concerning the interpretation, if £ < 1 (so ESQ(L) exists), then, asymptotically for large 
a, truncating ESQ(L) at 1 — (1 — o)? > a, y > 1, does not affect the value of ESa(L) much. 
As such, for large a, one can also work with the range expected shortfall ESa 1—-(1-a)7 (L) 
for any y > 1, which has the additional advantage of always being finite, even for € > 1. 


Solution 5.29 (Probability-weighted moments for the GEV distribution) Ex. p. 42 
Note that 


ut £((7lgy) *—-1), £70, 


F~ (y) = Hipo (9) = L — alog(- log y), Sc 


a) First consider 0 Z £ « 1. Then 


Br = Miro - E(F-(U)U") = | u+ S logu) 5 — Iu" du 


=u exp(—a(r+1)) dx 
0 


Rt Tu z 5 exp(—a(r + 1)) dz — fe exp(—z(r + 1)) de) 


= H of 1 l y xc 7 1 ) 
EM aa , aa exp(-y)dy - —4 


PER. e NU 
E c au sD) 


_ wt Er 1ErQ - 6 - 1) 


—]. 
r41 y r> 
For € = 0, we obtain that 
1 u oo 
B= f (u—clog(—-logu))u'du = -of log(x) exp(—z(r + 1)) dz 
0 z—-—loguT-d-1 0 
L- fi togty/(r +1) exp(-y) 4 
= = exp(— 
y-x(r41) r+1 r+1l1Jo By Pee 
AN VE (y) exp(-y) dy —1 (+1) 
Tri PEIN ogly) expi—-y) dy — Log (rT 
m o pt o» + log(r + 1)) 
= l +1))= : s 
uu a ee Se r> 


note that this also follows from the case 0 4 € < 1 by l'Hópital's rule (case ‘0/0’), using 
that lime. ,9 sr — £) — PA = y. 
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b) By a), it is easy to verify that 


i a O4E<1, 


Jt oy, & = 0, 
er(1—é(2-1), 04€<1, 
t c6 
EE s £ —0, 


lg3 — £ — Q. 


HE 
362-60 _ Jaap OFE<1, 
28 — Bo To 


By b), 


vo- HS 0zE«1, 


log3 
log 2> €=0, 


equals (382 — 8o)/(201 — Bo). Based on estimates Bon, Bin, Gan of Bo, £1, B», we thus 
obtain an estimate £j of £ via 


By b), estimates 44,04 of ui, 0 can then be obtained via 


281,n—Bo,n 
 jraü-éjge-pjee OF && <1, z 
n= 201.5 —D0,n = ( ) 
- lg2 ^" En = 0, 
iex Bon — E-E) 1), OF & «1, T 
Bo, — Onfy, En — 0. 


An algorithm can thus be given as follows. 

1) Compute the estimate Gon, Gin, P2, of Bo, £1, £a. 

2) Compute the estimate £, of € via (S1). 

3) Based on £n, compute the estimates on and un of o and p via (S2) and (S3), respectively. 
Note. According to Hosking, Wallis, and Wood (1985), 8, can be estimated via 


1«(k-1)--(k-r 


Uo ser] 


(n—1)---(n—rm 


based on the order statistics z(j € +++ X Xm) of the data z1,..., £n and where the empty 
product is understood as 1; by Landwehr and Wallis (1979), the estimator is unbiased. 
To obtain £, explicitly, Hosking, Wallis, and Wood (1985) suggest to approximate h-!. 
The approximation suggested is equivalent to h~!(y) = a/y? + b/y + c for a = —2.9554, 
b — —4.1297 and c — 3.782014. 
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Solution 5.30 (Probability-weighted moments for the GPD) Ex. p. 42 
Note that 
Be aah se 
y , €F0, 
F-()) = Ga) = I d 
—flog(1 E y), e = 0. 


a) First consider 0 4 € < 1. Then 


as = Mı o,s = E(F* (U)(1— U)’) =E fi 


(G 
Ca paoata (1 —u)* du) 


== (|r! 228 a ese "D - Ces en) 
8 é B 


= ~G=te hen) eae 


For € = 0, integration by parts leads to 


Qs = -f log(1 — u)(1 — u)? du 


2 o(a u)**log(1 4| fa u)* du) 


B [ i eee ect Em 
EIS ieu) du- cg 0-9 pee 


note that this also follows immediately from the case 0 Z € < 1 by letting € — 0. 
b) By a), it is easily verified that 


A CNN ES: Bu c p l 
Jet Opes" dese eE eue 
Therefore, 
“0 _=2 E SP (Sank eut 
re TOE 
and 


B B 
2ogo1 _ 2TA _ Te 


ao — 201 


B 
(1-£)2-£€) 1-£)2-£) 
c) By b), estimates £n, £5 of £, 8 can be obtained via 


(ymo ERO — (S1) 


Qn, 0 — 2an, 1 
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and 


200 nQ1,n 


Bn = (S2) 


Q0,n — 201, 


An algorithm based on b) can thus be given as follows. 

1) Compute the estimates 09,4, 01,5, 02,4 Of ao, 01, Q2. 

2) Compute the estimates £, and Sn of € and £ via (S1) and (S2), respectively. 
Note. According to Hosking and Wallis (1987), a, can be estimated via 


P. RARO queen 


Q m 
id n c (n—1):--(n—s 


based on the order statistics z(jj € -++ € Xn) of the data z1,...,, and where the empty 
product is understood as 1; by Landwehr, Matalas, et al. (1979), the estimator is unbiased. 
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Review 


Solution 6.1 (Sums of normal random variables) 

The statement is not correct in general in its current form as it does not specify the dependence 
between Lı and Lo. In particular, only if Lı and Lə are jointly normally distributed, so 
(Lı, L2) is bivariate normal or, equivalently, (L1, L3) ~ Na(qu,X) with w = (u1, u2) and 


2 
x- ( 71 pp do we know that 


p0102 o3 


L 1 
fx: 3 pex op. 5 oo ae = N( u1 + u2, 02 + 2po402 + 02) 
Lo Ha 1 


and thus, only if p = 0 (so in the uncorrelated or, here, independent case), that Ly + Lz ~ 
N(u + uo, a1 + 03). 


Solution 6.2 (Independence of uncorrelated normal variables) 

If X;,..., Xa are uncorrelated, their covariance matrix X is © = diag(o?,...,09). Then 
X = AA’ for the Cholesky factor A = diag(o1,...,04). Since X allows for the stochastic 
representation X = u + AZ for Z = (Zi,..., Za) ~ Na(0, Ia), we immediately obtain that 
Xj = uj +0;Z;, j € {1,...,d}, and thus that Xj,..., Xa are independent. 

Note. Alternatively, ifo; > 0, j € {1,...,d}, E = diag(o7, ...,02) has inverse X! = diag(1/o7, 
..., 1/02) and thus (æ — pY Et (æ — u) = 3$ (5 — pj foj. Therefore, the joint density fx 
of X equals the product of the marginal densities fx,, j € (1,..., d], since 


fxle) = sapra (ze n'a) 
"eene CRI e) 


d 
= II fx; (x3), LE R4. 
j=l 


Yet another approach is to show that the characteristic function of X is the product of the 
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Ex. p. 43 
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characteristic functions of X1,...,Xq. This holds here since 


1 1 Ts 
x(t) = E(exp(it' X)) = exp (wu — ;t2t) — exp (wu — St dtt) _= exp (wu — st?) 
i=A't 


d d 


d 
= exp (Do Gnt = 8m) _= I exp(iu;t; - (o;5?/2) = [] 4x, (t;), tem. 
j=l 


=i Cx MET: 


Solution 6.3 (Stochastic representation for normal variance mixtures) 


a) X 2 nt VW AZ. 

b) As in Exercise 6.2, if X1,..., Xq are uncorrelated, X = diag(o2,...,02) (a covariance matrix 
but in general not the one of X = (X1,..., Xa); see MFE (2015, equation (6.19)) for this 
important fact) is diagonal, and thus its Cholesky factor A = diag(oj,...,0 4) is diagonal. 
We thus have that 


Xj = pj t0; W Zj, j € {1,...,d}, 


so, intuitively, the components X1,..., X4 of X are dependent (through the same random 
variable W) unless W is constant almost surely (in which case X follows a multivariate 
normal distribution). 


Solution 6.4 (Normal variance mixtures as elliptical distributions) 
X ~ Ma(p, X, Fw) allows for the stochastic representation 


X=p+VWAZ, 


where W ~ Fy independently of Z ~ N;(0,J;) and where A € R^*^ satisfies AA’ = X for 
some k. Therefore, 


X =ptVWAZ = pt VWZA|ZI|/IZ| = w+ RAS, 


for R= /W|Z|| and S = Z/||Z||. Note that R > 0. By MFE (2015, Corollary 6.22), R = ||Z || 
and S = Z/||Z|| are independent. Furthermore, since Z ~ N4(0, 74) is spherically distributed 
(see MFE (2015, Example 6.19)) it follows from MFE (2015, Corollary 6.22) that Z/||Z|| is 
uniformly distributed on the unit sphere in d dimensions. Therefore, normal variance mixtures 
are elliptical. 


Solution 6.5 (Distinguishing distributions by scatterplots of random samples) 
a) The middle-right plot. 

b) The middle-left plot. 

c) The top-left plot. 

d) The bottom-middle plot. 

e) The top-middle plot. 
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f) The bottom-left plot. 

g) The middle-middle plot. 
h) The bottom-right plot. 
i) The top-right plot. 


Solution 6.6 (Advantages of generalized hyperbolic distributions) 

Generalized hyperbolic distributions have a theoretical motivation in that they may appear as 
increments of multivariate Lévy processes for modelling prices in continuous time. They are 
particularly tractable, since the distributions of linear combinations are always of the same 
type. They can accommodate heavy tailedness and skewness. They can easily be estimated 
using variants of the EM algorithm due to their structure as mean-variance mixtures of normals. 
See MFE (2015, Section 6.2.3). 


Solution 6.7 (Different kinds of factor models) 
In macroeconomic factor models the time series of factors is observable. In the context of 
modelling equity return series it may for instance be a series of observed market index returns. 
Models are estimated by time series regression. 

In fundamental factor models factor time series are constructed from time series of risk-factor 
returns by categorizing these return series according to fundamental factors, such as country, 


industry and size in the case of equities. Factors are then estimated by cross-sectional regression. 
In statistical factor analysis no preconceptions regarding the nature of the factors are applied. 


Instead factors are discovered by statistical estimation. A common method is to find the factors 
by carrying out a principal component analysis (PCA). 
See MFE (2015, Section 6.4). 
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Solution 6.8 (Independence of uncorrelated Bernoulli variables) 
Clearly, independence implies uncorrelatedness. For the converse, note that 


P(X, =1, X2 = 1) = E(X1 X2) = cov( X1, X2) + E(X1)E(X2) = 0 + E(X1)E(X2) 


I 
5 
I 


P(X = 1, X2 = 0) = P — P(X, =1, X2-— 1) 


= PX = 1)P Ay = 1) = Pea = 1)(1 - P(X: = 1) 


P(X, =0, X420) 2 P 


D» 


( 
( 
( 
( 
P(X, —0, X5 — 1) = P( 
( 
( 
( 
( 
( 


( 
2 
- P(X, =0, X= 1) 
( 
2 
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Ex. p. 44 Solution 6.9 (Calculating the correlation coefficient from a joint density) 


a) For 21,22 € [1,0o), we have that 


T2 Ti r2 X1 —(0--2) 
F(z1, £2) = n f f(a, 22) dz dz» = 0(0 + nf i (z + Z2 — 1) dz dz» 


01) 8 -(041)]^! 


£2 » 1 
-f (21 + 22 — ] we» — £5 y dz = ern (zi + 22 — 1r? = T 


punc) Sa," egt) Ge teal) tu ewe 


b) The marginal distribution functions are given by 


Fi (21) = lim, F(x1, £2) = ,Um (oi + x9 D zo’ te? +1)=1- z7’, TE 


: ; EP Ac M ~0 
F (x2) = Lm. F(z1, £2) = lim (a + x2 — 1) Xj — 253 +1)=1- z, x2 € 


c) For (X1, X3) ~ F, we have that 


E(X 1X2) = 0(0 + 1) f | z1z2(z1 + £2 — (ye) dx, dro 


z—rz1-4z:2—1 


oo 1 = 1 oo 
= 0(0 T 1) f X» E + That 0m dz» 
1 - 


= 99» f za f (z — z2 + 1)2- 0*2 dz dzo 
1 £2 


—(0 + 1) Z=£2 
= E XP l1 _ọ ce dd 
= 0(0 » | va( G05 + O41 ^? day 


EU ( 6+ 1)x "m + 0(—22 + 1)23°) dz» 
1 


oo 1 0 oo 
-6 4 gaz? |g —042 ESI 
d (ss TS ) m lay Tes "bus 


E 0 — @-—0-—1 
73 0—1 (0—2)0-—1) 


if 0 > 2. Furthermore, in terms of the density fi(ri) = or TS zı > 1, of F}, 


meyk — [> kon (D - k-6—1 k—0 ER. 
xb = f zj0x, rdai =o f 0x1 rds, = 0[.— "d = =e 


= ——, k 1,2 
gp ^e) 


180 


Basic 


if 0 > 2, and so E(X?) = E(X2) = 0/(0 — 2) and E(X1) = E(X5) = 0/(0 — 1). Therefore, 


p= aXXa E cov( X1, X3) E E( X1 X3) = EX1 EX» 
vvar(Xi)var(X3) — J/(E(X?) — (EX1)?)(E(X3) — (EX2)*) 
682—0—1 ( 0 y (02—0—1)(0—1) 62 (02—0—1)(0—1)—0?(0—2) 
_ (9-2)(0-1) 0—1 u (9—2)(0—1) 0-1 — (6—2)(0—1) 
GSP o — DN Uu 
| (0-0-1(0—1)-0(0—2 6 —0?—0—0-0-1—09 420? 1 
6(0 — 1)? — 02(0 — 2) B 03 — 20? + 0 — 83 + 20? (0 


Note that we needed 0 » 2 for these results to hold. 


Solution 6.10 (Cholesky decomposition) Ex. p. 44 
Assume X to be positive definite, so that z/Yz > 0 for all a € R?\{0}. In particular, o1 > 0 
(otherwise, take x = (1,0) to see that o? = xXx = 0, a contradiction). Let 


[ue 0 ,_ fa O\fa Y ag ab 
sa (c) ome awohe Ns )=(2 pta) 


Since AA’ = X, we must have a = o4 > 0 and ab = pojo2, so b = pos, and we must have 
+c? = 03, so c = "LE -b = "TE — p?a2 = a34/1 — p?. Hence, for a positive definite X, A 
is uniquely given by 


A= O1 0 
poz oyl- p?]' 
Solution 6.11 (Density of multivariate normal) Ex. p. 44 
a) The density of Z is 


g 1 1, 
fz(z) = Ie? T Gym ew 3l z € R$. 


b) Note that rank A = d implies that X = AA’ is positive definite and thus invertible. By the 
standard method of density transformation, 


Ix (e) = frcz)(®) = fz Tæ) dt (T7 (2)) | 


With T(z) = p + Az, we have T~1(x) = At (æ — ps) and AL T-1(a) = A-1. Therefore, 


fx (x) = aE exp( uc E p) (A^! (x EN p)))Idet(A7!)| 
= Gain exp( ja p) (ATIY A7! (a — n) | det(A~!)|. 
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Smce(A- Ty A^ esCA Hp = (AA tS] and 
det(A 1) = 1/ det(A) = 1/,/det(A) det(A’) = 1/Vdet © 


we obtain that 


xl) = Soares OP (3 WE@-p)), wer 


Ex. p. 46 Solution 6.12 (Conditional distribution of bivariate normal) 


2 
o O10: H arp H . . . . 
As = i P21" \ is positive definite, it is invertible, and we have 

p9102 O3 ? ’ 


5-1 1 as —p0102]| . 1 c2 —00102 
det X | —00102 c2 (0102)? — (poi02)? V —02102 o? 


= 1 | o pL 1 ( 1/01 a 


(a102)? (1 - p?) \-poie2 o? 1—p? \-p/(o192) — l/ej 


This implies that 


(æ — p£ (æ — p) 


1/0? =p/ (0102) | [71 — 
(x1 — H1, T2 — 2) s "Mod ( a 


_ T2 — u2 
1- 
M — 
((a1 — 11)/03 — p(z2 — u2)/o102, —p(1 — p1)/o102 + (a2 — u2)/o3) E. f a 
E (454 la 209-8 mm (mm y _ 32 - 2pi is + gà 
i 1— p? 1- , 
where 2; = (rj — uj)/oj, j € {1,2}. By Exercise 6.11, 
fx s (11,22) = — exp ( - 5 - YET Ge - a) 
T1, T2) = ex z z— 
Tie dee eomm TASSE T n 
1 i3 935.432 
= exp( zi Ed + 2), 11,212 € R. 
270102\/1 — p? 2(1 — p?) 


Furthermore, 


1 1/24 2 1 -2 
w= ex = exp(—Z7/2), 21 ER. 
fx, ( 1) E »( ;( Jono, p( i/ ) 1 


01 
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We thus have that 


f a fxixs(mi 22) _ V2nai exp( £i — 2p%1 + FB uy 
Spes fx, (21) 2101021 — p? 2(1 — p?) 2 


1 p?zi — 2pii3o + 2) 


1 
ex 
V2ro2y 1 — p? Zi 2 1—p? 


tee) 
2102/1 — p? E MED 1— p? 


- 1 ew( < (u2 + p32 (x1 E 
02 V naa /1 — p 2 ele 


which equals the density of a N(u» + p22 (x1 — p1), o3(1 — p?)) distribution as claimed. 


Solution 6.13 (Covariance matrix of a mixture distribution) Ex. p. 46 


a) Assuming it exists, the (i, j)th entry cov(X;, X5) of cov(X) for i Z j equals 


cov(f (W, Zi), f(W, Z;)) 

= E(f(W, Zi)f(W, Z;)) - ECf(W, Zi) EG(W, Z;)) 

= EEL, Zi) f(W, Z;) | W)) - EF, Zi))EG'(W, Z5) 

= E(E( f(W, Zi) f(W, Z;) |W) - ECF(W, Zi) | W)ECF(W, Z;) | W)) 
(OV, Zi) | W)ECFQ, Z;) | W)) - ECQW, ZEC (W, Z5) 


) 
(W, Zi) |W)) — E(ECFOW, Zi) | W))EQECF(W, Z5) | W)) 
= E(cov(f(W, Zi), f(W, Zi) | W)) + cov(ECF(W, Zi) | W), ECf(W, Z;) |W) 


and thus, in matrix notation, cov(.X) = E(cov(.X | W)) + cov(E(X | W)). 
cov(.X |W) = W implies that E(cov(.X | W)) = E(W») = E(W)X. Furthermore, with 
E(X |W) = u+ Wy we obtain that 


2 


cov(E(X | W)) = cov(u+ Wy) = cov(W7) = (cov(W yi, Wi) Jij 
= (E(W4 Wq) — E(W4Y)E(W))i j = (EW? ig - EW) iY) 
= (var(W)yi i)i j = var(W)yy. 


Thus, by a), 


cov( X) = E(cov( X | W)) + cov(E(X | W)) 2 E(W)X + var(W)¥7’; 


compare with MFE (2015, equation (6.28)). 
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Solution 6.14 (Normal variance mixture with discrete mixing variable) 


a) We have 
X1 0 I -2 Zı 
E aaa 


which is of the required form u + VW AZ for p = (0,a), A = (3 ij and Z = (Zi, Z2), 
where W > 0 and Z ~ N2(0, I2) are independent. We thus have that X ~ Mo(p, X, Fw) 
for X = AA’ = ($1). 

b) We know that for a normal variance mixture with E(W) < oo, cov(X) = E(W)£; see MFE 
(2015, equation (6.19)). Since E(W) = 1P(W = 1) + AP(W 24) = p + 4(1 — p) 24—3p— 
4 — 2 = 2, we have that cov(.X) = 23: = p) 

c) Xı +X = 1'X where 1 = (1,1). Therefore, X; + X» ~ Mj(1'p, 1/D1, Fw) = Mı (mı + 
u2, 3331 4- 29219 + Noo, Fw) = Mi(a, 10, Fw) and so, in distribution, Xj + Xo = a+ VW V10Z 
for Z ~ N(0, 1) independently of W. By conditioning on W, we obtain that the distribution 
function of X4 + X» is given by 


Fy, 4Xx_(x) = P(X1 + X2 € x) = P(a + V10WZ < x) = p(z aa ) 


(rs Taw |”)) "rrr 


E o(—* Pw - 1) +0(—* Jew = 4) 


(a 


Solution 6.15 (Normal variance mixture with Pareto Type | mixing variable) 
a) We have 


L2] 


which is of the required form u + VW AZ for u = (0,0), A = (i 4) and Z = (Zi, Z2), 
where W > 0 and Z ~ N2(0, I2) independently of W. We thus have that X ~ Mo(p, X, Fw) 
for X = AA’ = (39) = 2h. 

b) We know that for a normal variance mixture with E(W) < oo, cov(X) = E(W)X; see MFE 
(2015, equation (6.19)). Since 0 > 1, E(W) < oo and E(W) is given by 


ce 
1 6-1’ 


0-1, 


ww). | EIL ee l 2-041 
iw) = | wfw(w)dw = 0 | w dw = 8|; 1" | 
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where fw denotes the density of W (fw(w) = 60w- (9*9, w € (1, 00)). We thus have that 
cov( X) = 742 = gr I». In particular, 


corr(.X4 X2) = cov (Xa, Xa) = E — h 
? v/var( X1) var(X2) rM ; 


so Xi, X» are uncorrelated (which can also be inferred directly from the diagonal form of 
cov( X)). 


Solution 6.16 (Diversification and the single-factor model) 


a) 


= 


Let X = (Xj,...,Xq) and Z = (Zo,..., Zq). The independence of Zo, ..., Z4 yields 


var(X) = p var(Zo) + (1 — p?) var(Zj) = p^o? + (1.— p?)o? = o? 


cov(Xi, X5) = cov (pZo + 4/1 — p?Zj, pZo+ y1- p?Z; 
= E((pZo + J1— p?Zi)(pZo-- y1- p?Zj)) - 0 


= E(P Z?) + E(pZov/1 — p?Z;) + E(pZoy/1 — PZ) + E((1 — p^)ZiZ;) 
e Z) = p0?, qoe bec dlche. 


Aci! 


Hence cov( X) = X, where X = o?P for a correlation matrix P with off-diagonal entries p°. 


Since Z ~ Nq11(0,0?14,1) and X = AZ for A € Rex (4+1) (with first column containing p 
and the remaining columns being y1 — p?Iz), we know that X is multivariate normal with 
mean vector AO = 0 and covariance matrix X as computed in a), so X ~ N4(0, X). 

Since X1 ~ N(0,c?) (as a linear combination), VaRa(X1) = ec-!(a). Furthermore, 
Xa — X X with A= (1/d,...,1/d), so Xa N(A'0, EA), where 


N 


NDA = (1/d)'o? P(1/d) = EU = 7 a(t + p*(d—1))= =a + p*(d—1)). 
We thus have that VaRa(Xa) = o/(1 + p?(d — 1))/d&-! (a). 


Note. Note that X1 = Xi, so we also obtain VaRa(X1) = o9-!(o) from VaRa(Xa) for 
dT. 


The previous calculations yield 


i d d ' 


Xung oret) M30). : MEINEM cu py coit 
VaRa(X1) oð- (a) = = Ae 


which is decreasing in d. Hence increasing d decreases the portfolio risk and thus investing 


1/d in each of X1,...,Xq decreases VaR, compared to investing the whole capital in X4. 


However, VaR4(X4)/ VaRa( X1) converges to |p| = p for d + oo. For already large d, a 
further increase of d will thus not change VaR4(.X4) by much and thus the diversification 
effect cannot be significantly improved. 
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Solution 6.17 (VaR of the sum of jointly normal variables) 
Recall that for L ~ N(u, 0°), VaRo(L) = u + o^! (o). 


2 
a) Let (X1, X2) ~ No(p, E) with E = (75. 0777). Then Xi eX» = 1 (2) ~ Nau, VEL) = 


poi02 o$ 


N(u1+p2, 02 -2p0102--02) and thus VaRg(X 1+ X3) = mı ua od + 2p0102 + 029 ! (a). 
b) Xi: X42 UX ~ N('p, 1X1) = N(©2 uj, 32-2 Diz). Therefore, VaRs (X1 + 
co Xa) =U pt VISIO (a) = 354 uj + y Eydt (a). 


Solution 6.18 (Contrasting models for uncorrelated normal variables) 


a) Since Y; = X1, the first margins are equal (standard normal). We have left to show that 
Yo = V X, is standard normal (as Xə is). To this end note that Fy,(r) = P(Yo € x) = 
P(VX; € x) =E(P(VX € z|V)) = PF(-Xi € x)/24+ P(X1 € z)/2 = (1 — 9(—2))/2 + 
$(7)/2 = ®(x)/2+ 6(z)/2 = P(x), x € R, so Yo ~ N(0, 1). 

b) Since (X1, X2) ~ N2(0, I2), cov(.X1, X2) = 0 and thus X4, X» are uncorrelated. For (Yi, Y), 
we have that cov(Y;, Y2) = E(Y1Y3) - E(Yj)E(Y3) = E(V X?) —0 = E(V)E(X?) = E(V) = 0, 
so Yi, Ys are also uncorrelated. 

c) Since (X1, X2) ~ N2(0, I2), we know that X4 + X2 ~ N(0,1 +1) = N(0,2), so Fx x, (x) = 
$(z/V2), x € R. Furthermore, Y; + Y = X1(1+V) which equals 0 with probability 1/2 
and 2X, with probability 1/2, so Fy,+y (£) = P(X1(14- V) € x) = E(P(X1(14- V) < 
z|V)) = Ií»0)/2 + $(7/2)/2 = (1150) + $(2/2))/2. 

Note. The left-hand side of Figure 8.6.1 displays Fx,4 x, and Fy, +yz. 


d) It follows from c) that VaR4(X1 + X3) = v26-1(o), a € (0,1). Concerning VaR4(Y; + Y2), 
note that Fy,+y,(0—) = 1/4 and Fy,+y,(0) = 3/4, so VaRa (Y1 +Y2) = 0 for alla € (1/4,3/4]. 
For a € (0,1/4], inverting x ++ ®(a/2)/2 leads to VaR4(Yi + Ya) = 29-7!(20) and for 
a € (3/4,1), inverting x — (1 + $(z/2))/2 leads to VaR4(Yi + Y) = 207! (2a — 1). 
Therefore 


20-1(20), a € (0, 1/4], 
VaRa(Yı + Y2) = 40, a € (1/4,3/4], 
2@-!(2a—-1), o € (3/4,1); 


note that a — VaRa(Yı + Y2) is continuous on (0, 1). 
Note. The right-hand side of Figure 8.6.1 displays VaR4 (X4 + X2) and VaR4(Y; + Y). 

A numerical computation leads to VaRo.9(X1 + X2) ~ 1.8124, VaRo,9(Y; + Y2) = 1.6832 
and VaRo.99(X1 + X2) = 3.2900, VaRo.99(Yi + Y) = 4.1075. In particular, for the two 
different distributions with uncorrelated normal margins, we have, due to the different tail 
behaviors of theirs sums, VaRo.9(X1 + X2) > VaRo(Yi + Y2) but VaRog9(X1 + X2) < 
VaRo.99(¥i + Y2). 
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Figure S.6.1 Distribution functions Fx, +x, and Fy, y; (left) and corresponding VaRa(Xı+X2) 
and VaRa(Yı + Y2) (right) for (X1, X2) ~ N2(0, 72) and (Y1, Yo) = (X1, V X1) 
where V is independent of X; with P(V = —1) = P(V = 1) = 1/2. 


Solution 6.19 (VaR-minimizing portfolio for a normal variance mixture) 

a) For Y ~ N4(0,X), we know that a/Y ~ N(a/0,a/Xa) = N(0,a/Xa), a € R0). A 
stochastic representation of a/Y is thus V a/XaZ for Z ~ N(0, 1). 

b) By a), 


3 3 3 
aX = VWa'Y 5 VWV a Yaz = A T YW VUEZ 5| 7 V WOY = y T tX, 


SO Cab = V a/3Ma/b'b. 


c) With b = (1,0,...,0) € Rf and thus b'X = Xj, b) implies that a’ X = co 4b X = ca pX1. 


By positive-homogeneity of VaR,, 


"3 
VaRa(a’X) = VaRa(ca,pX1) = Cap VaRa (X1) = = S Vena). 
11 


d) Note that a € (1/2, 1) implies that VaR4(.X1) > 0. By c) we thus only need to minimize 
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the quadratic form a/Xia. Since ag = 1 — a1, we have that 


a 
a/3a = (a3X31 + (1 — a1) Ea, a1 212 + (1 — a1) B22) [ 2 


= a2Y1 + a(l — a )%21 + a(l — a1)312 + (1 — a1) E2 
a2331 + 2a1(1 — a1)921 + (1 — a1)’ E22 
2a? Paya) + (1 — a1}? = 2a? — ay +1 = 2( (a1 — 1/4)? + 7/16), 


which attains its global minimum at a; = 1/4 with value 7/8. Therefore, the VaRa- 
E urs weights of the form (a4, 1 — a1) for a4 € [0,1] are (1/4, 3/4) with value 


VaRa( = /(7/8)/2 VaRa(X1) = (7/4) VaRg(X1). 


Ex. p. 47 Solution 6.20 (Linear combinations of t random variables) 


a) X ~ ty(v,p, X) is a normal variance mixture and thus has a characteristic function of the 
form 


x(t) = exp(it'u)fw(tXt/2), t € RH, 


where Fy(t) = E(exp(—tW)) denotes the Laplace-Stieltjes transform of W ~ Ig(v/2,v/2); 
see MFE (2015, equation (6.20), Example 6.7). The linear combination Ag 4- AX has 
characteristic function 


paota x (t) = E(exp(it(Ao + AX))) = exp(itAo)E(exp(i(£A)' X)) 


= exp(itAo)óx (tA) = exp(itÀo) exp(i(tA)'u) Fw ((()X((3)/2) 
= exp(itAg) exp(itA p) Fy (1234/2) = exp(it(Ag + A u)) Fw (2A'x:4/2) 


which is the characteristic function of a t(v, Ao + A'u, AXA) distribution. By uniqueness of 

characteristic functions, Aot AX ~ t(v, Ao4- A p, NEAN), so (Ao4- A X — (Ao4- A u)) / V NEA 

is standard £ distributed with v degrees of freedom. 

Ky jo (volt) vit"? 
T(v/2)27/2-1 , 

where K, is the modified Bessel function of the second kind. The characteristic function of 

Ao + P» AjXj is thus 


b) The characteristic function of a univariate t distribution is ¢x,(t) = 


d 
osx x(t) = E(exp(it(Ao + A X))) = exp(itÀo) [ [ E(exp(itA;X;)) = exp(itÀo) o%, (t) 
j=l 
se Shel Sy 
L'(v/2)27/2-1 


= exp(itAg) ( 


The answer cannot be inferred from a) because a vector of independent univariate 
Student t variables is not a special case of a multivariate Student t distribution; see also 
Exercise 6.3 b). 


188 


Advanced 


Solution 6.21 (Linear combinations of elliptical random vectors) 
Let ¢x,¢y be the characteristic functions of X,Y, respectively. We know that ¢x(t) = 
ei H(t Yt), t € R*, where v» denotes the characteristic generator of X. Therefore, 


dy (t) = E(exp(it' (BX + b))) = e" "E(exp(it' BX)) = e" "E(exp(i(B't) X)) 
= e ^a x (B't) = et be KG D nyc B't)'D(B’'t)) = eit (0+ Be) y(t! BY B't), 


which is the characteristic function of a E,(6+ Bu, BXB’, 1) distribution. By uniqueness of 
characteristic functions, Y = BX +b ~ E,(Bu+ b, BSB’, 4%). 


Solution 6.22 (Convolutions of elliptical distributions) 
Let ¢x+y denote the characteristic function of X + Y. By independence and with Y(t) = 


v(t)v(ct), 
óxav(t) = óx(t)óy (t) = et ^(t'Yit)e" Phat) = e GP yt), 


which is the characteristic function of a Eal + jt, E, V) distribution. By uniqueness of 


characteristic functions, X + Y ~ Eq(w+ A, Xi, v) for Y(t) = v(t)u(ct). 


Solution 6.23 (Fitting generalized hyperbolic distributions to equity return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 6.24 (Fitting a one-factor model to equity return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 6.25 (Principal components analysis of equity return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 
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Solution 6.26 (Equicorrelation matrix) 

a) Let Y = (X1/o1,..., Xa/oq). Since cov(Y;, Yj) = cov(Xi/oi, X5/0j) = cov(Xi, X;)/(oio;) = 
corr( X;, X;) for all i,j = 1,...,d, we have that P = corr(X) = cov(Y). P is positive 
semi-definite since 


a/ Pa = a! cov(Y)a = cov(a/Y) = var(a'Y) 20, a € R^(0). 
b) From a) we know that 1'P1 > 0, where 1 = (1,...,1) € R7. Since 


1 e An 
1'P1-(,...,1)| 
ie 
p p 1 
1 
= (1+(d—1)p,...,1+(d—1)p) | : | 2414 (d—1)p), 
1 
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we obtain that d(1 + (d — 1)p) > 0 which happens if and only if p > —1/(d — 1). 


Solution 6.27 (On the invertibility of the sample covariance matrix) 


a) Note that for d-dimensional random vectors Yj,..., Yn, 
n Y; 
DO YY; = (Yis Yn) | : | 2Y'Y*, 
i=l Y/ 


TL 


where Y = (Y],..., Y] € R?*4. Now consider Y; = X; — Xn, i € {1,...,n}. Then 
Y = X — (Jn/n)X = (In — J,/n)X which implies that 


1 = = 1 ie 1 
u = Xi = Xn Xi — Xna) = 5 f Ley 
" TAL) Á ) al "PT d 
1 1 
= AX 0. — In/n)')(Un — Jn/n) X) = —1X n — Ju /n)(4s — Ja /n) X 
EV. 1 / | 2 2 EE 1 / 
nr m Dt (I, — Js /n — In/n + J; [n*) X nont i* (In — J4/n)X. 


b) Since the sum of the second to the last row of In — Jn/n equals its negated first row, 
rank(I, — Jn/n) € n — 1. Subtracting from each of the second to the last row the first row 


of I, — Jn/n produces a block matrix of the form E 5" E d Because of the lower-right 
block Jn, rank(I5 — Jn/n) > n — 1 and thus rank(I, — J,/n) — n — 1. 
c) Using a) and 
i) rank(AB) € min{rank A, rank B} 
ii) rank A’ = rank A 
iii) rank(A) € min(m,n) if A c R™*” 


we can argue that 
rank(S,) = rank(X'(I, — Jn/n)X) € min{rank X’, rank(I, — J,/n), rank X] 
i) 


" min{rank X,n — 1) € min(min(n, d}, n — 1) = min{n — 1,d}. 
ii iii) 


In particular, if n < d, then rank(S,) is at most n — 1 < d so that S, € R^*4? is not 
invertible. 


Note. As we can see, the scaling 1/(n — 1) used for S, does not affect the results, the same 
thus applies to the scaling 1/n (the maximum likelihood estimator of the covariance matrix 
under joint normality). 


Solution 6.28 (Characterization of the multivariate normal distribution) 
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a) According to the uniqueness of characteristic functions, X 5 Y if and only if dx (t) = dy (t) 


for all t € R7. Clearly, the latter implies that ó,x(t) = E(exp(ita/X)) = ¢x(ta) = 
óy(ta) = ówy(t) for all t € R, a € R4 and thus that a/X = a’Y for all a € R4. 


The converse also holds since tX = tY for all t € Rf, so ¢x(t) = E(exp(it/X)) = 


E(exp(it’Y)) = óy (t) for all t € R? and thus X 5 Y. 


b) Let X ~ Ng(y, X) and a € R*. Then 


Qa/X (t) = 


espuma x)= dxtia) Sem (tayu = 5 (ta)'S(ta) ) 


1 
= exp (ra'u — jaa). tER, 


which is the characteristic function of a N(a'u, a'Xa) distribution. Uniqueness of charac- 
teristic functions implies that a' X ~ N(a'p, a/Xa) for all a € R4. 

For the converse, let a/ X ~ N(a'p, a/Xa) for all a € IR. Furthermore, let Y ~ Nq(p,). 
We have just seen that a'Y ~ N(a/p, a/Xa) for all a € RÍ, so that a/X 5 a’Y for all 


a € R?. By a) X 5 Y and thus X ~ N4(p, X). 


Solution 6.29 (Conditional distribution of multivariate normal) 

Consider Y = X5 + AX, for A= Sun c R(-9**. Since Y = BX for B = (A, Ia_k), 
we have Y ~ Na (Bp, BX B). By definition, Xo = Y — AX, so that Xə| X = zı 2 
Y — Azı ~ Ng_x (Bu LÍ Aa, BSB’). Since 


Bp — Axı = (A, Ia X) (5) — Az, = Ap, + p2 — Ad, = u2 — A(21 — pa) 


= pa + XX qi (zı — pi) = uaa 


and 


» » A’ A’ 
BXB' = (A, Ig.) | us 3 | = (AX; + X21, AX15 + 32) | 


331 M22] (a-k Ig 


= ADA! X31 A’ + AXi + X25 
= Ya hy En (Dah) — Xa053Xg]'- Ean E 4X2 


= Xa (Xa hy) — Xi (Xi yl — Eg X plo + X2 


E 
= X22 — 39313111 %12 = X221 


the claim follows. 


2 
Note. For d= 2, k = 1 and X = ( 71 se with 71,02 > 0 and p € (—1, 1), we obtain that 


p9i02 9% 


p91902 


X3| Xy = z1 ~ N(ua + Se (21 — ui); o3 — EN. P0192) = N(us + pg (ai — un); o3(1— p?)) 
which is what we already saw in Exercise 6.12. 
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Ex. p. 50 Solution 6.30 (Density of multivariate t) 
Let D? = (æ — u)'Xi (zx — p). 
a) Note that rank A = d implies that X = AA’ is positive definite and thus invertible. Since 
X —n-- VW AZ, we have X |W ~ Nalu, WE) and thus 


Fre) = [^ cca ae (ca (6 7 1 89) — p) ) Far) dw 


oo 1 D? 
ü | (21)4/2 / ud det © Yum (=ar o a 


where fw(w) = (v/2)"/2w-"/?-! exp(—v/(2w))/T(v/2), w > 0, denotes the density of an 
Ig(v/2, v/2) distribution. With z = 1/w (so dz = —w~? dw and thus dw = —z~? dz), we 
obtain that 


(v/2)"? JE wu (*2/2-1 o( C Pa 
0 


K@= Toy Jo Quy dE 2w 
S (v/2)*2/? T E E (- v + = fs 
L'(v/2)(rv)2/24/ det © Jo 2w 
= CAA M gut) /2-1 exp( =z" = 2^) dz. 
z=1/w T(v/2)(nv)d? / det Y Jo 2 


For a = (v + d)/2 and 8 = (v + D?)/2 the last integral equals 


T 2°-1 exp(—z8) dz = a) ru à z*-lexp(—zf) dz = ate) 
0 B* Jo 


Tr(a) Bo 
S r(( + d)/2)  T((v 4 d)/2) (1+ 2 
(v/2) 920 + D2/y)-9/2 — (yJ2) 97 - 
and thus 
u I'((v + d)/2) (a — u)X-l(x— u) — ed P 
Pes T(v/2)(nv)4/2 det X; (1 » ) , «&ER"’. 


b) For any æ € Rf and v — oo, we have that 


= T((v + d)/2) D? p2\ -4/2 
ide T(v/2)(v/2)9/2(2x)4/2A/det Y: ( ub ) (1 T) 

= 1 E | B (ere d 

— Qu)? /det Y: ^ v[2 2 I2) 


Since 
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we have left to show that 


By Stirling’s Formula, T(y/2)  J/2n(y/2 — 1)((y/2 — 1)/e)"/?-! for y — oo. Therefore, for 


V — oo, 
(ure D E Va FDZ - (a d)/2- 1/9) * ^ 
2 T(v/2) 2n(v/2 — 1)((v/2 — 1)/e)"?-1((v + D?)/2)4/? 
tium T) (UF DSI yeu 
B v/2—1 ( v/2—1 ) (aros) 


€ d/2 
E aee 3 a) 
B vy—2 v —2 e(v 4- D?) 


| Ac (d-2)/v dj2 N59 (14 (d 2)/v 1? 
2 1—2/v (1+ o as) ( rR) exp(—d/2) 


which converges to 1 since the first and the third factors do and the second converges to 
exp(d/2). 


Solution 6.31 (Conditional distribution of bivariate t) 
As in Exercise 6.12, det © = (o102)?(1 — p?) and 


~2 ~ z ~2 
£ Ti- 20% Fo FT 
(a m p) ‘(a m = ! 1— p? A 


where &; = (rj — nj)/oj, j € {1,2}. By Exercise 6.30, 


_ I((v* 2)/2) (@ — uyE-(r-pg)NV 3 

PRU Fio erg Ü 
T((v + 2)/2) (1 Z — 2p% + č 

T(v/2)rvoyoa/1 — p2N — (1 — p2)v 


BEER 
) , 21,929 € R. 


Furthermore, 


— I(v41)2) (, CEPTO r(v41)/2) 7, , 8b 58 
werd n ) Se v 
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We thus have that 


Px) xii (£2) = Pete 


v+2 
Ži —2p%1 Lotz E 
e+) (C+) 


r(( + 1)/2)Vrvo v1- p? G 


-2 MCI 
T'(((v 4- 1) 4 1)/2) [em j 
z 2 
T((v +1) )/2) ves T — Py + Ë EE 
L2 (v+1)+1 
r((w +1) + 1)/2) (5 aee 5 
(w+) + 2630 2) vtži TES 
r((w +1) )+1) )/2) ( Te A 32-592). -CPH 
(Gr 0/2) nv o3. — 22d Vd 
(v+1)+1 
A2) \ 
L TG 1/2) ET 
= e 
((v 4- 1)/ "me merci 
MESES 
(22—(u2-- 22 (31 —11))? 2 


r((w +1) 4 1)/2) o3(1—p jd 


» %2 x v+1 
(Gr 0/2) «(7 o3. — A 


distribution 


which equals the density of a (v+1, ua + p22 (a1 — m), o2(1— p?) Fn O 


as claimed. 

Note. By Exercise 6.30 b), the (multivariate) Student ¢ distribution converges to the (multi- 
variate) normal distribution for v — oo. This implies that the conditional distributions also 
converge. As a result, the location and scale parameters of the conditional t have to converge 
to the mean and variance of the conditional normal distribution. This indeed holds since the 
location parameter u2 + 02 (m — p11) of the conditional £ equals the mean of the conditional 


normal as given in Exercise 6.12 and the scale parameter c2(1 p?) ctm a)? of the 


conditional ¢ converges to the variance c2(1 — o?) of the conditional normal for v — oo. 


Solution 6.32 (An interpretation of the principal components transform) 
a) We already known that Y; = y, X, so have left to show that 


var(y,.X) 2 max  var(a'X). 
a€R4, |a||21 
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Let us focus on the right-hand side. Since var(a/ X) = a'/Xa, we can consider the Lagrange 
function 


L(a,n) 2 a/'Ya—5(a/'a—1), ac Ri, 


with Lagrange multiplier 7. Then 


VL(a,n) = ocu = RO 2) 


which equals 0 if and only if 
Xa 5a and da=l. 


We thus see that the maximal var(a/ X) is necessarily attained for a being a standardized 
eigenvector of © (an eigenvector with unit norm), so one of Ņ1,..., Ya. To see that it is 
indeed 771, suppose a is a standardized eigenvector and the eigenvalues of X are sorted in 
decreasing order Ay > --- > Aq 2 0. Then, for all j € {1,...,d}, 


var(a' X) = a/Xia = Aja'a = Aj (S1) 


which is maximal for j = 1, so the standardized eigenvector a which maximizes var(a/ X) is 
yı- 
Note. With maximization replaced by minimization, the same argument applies to show 
that yq (the standardized eigenvector a corresponding to the smallest eigenvalue Aq of X) 
minimizes var(a' X). 
We already known that Yo = 45 X, so have left to show that 

var(45X) — max  var(a'X). 


a€R4, |[a||-1, 
a'yı=0 


Focus on the right-hand side and consider the Lagrange function 
L(a, ni, n2) = a/Ya — m(a'a e 1) mayı, ae R4, 


with Lagrange multipliers ņı and ro. Then 


25a — 2noIga — yı 2(Xa — Na — 01/2) 
VL(a, m,n) = —a'yı = —a^ j 
—(a'a — 1) —(a'a — 1) 


which equals 0 if and only if 


Ya—ma-—my/2=0 and ayı =0, a’a=1. 
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Multiplying the first equation by yi from the left, using the two latter equations and the 
fact that X^; = A171 (since *1 is an eigenvector of X with eigenvalue A1), we obtain that 


0 = 41Xa — mya — mom /2 = (m)'a - m /2 = Ana — m/2 = —m/2, 
so 7, = 0 and thus 
Ya-—ma=0 and a'yı =0, a'a- 1. 


We thus see that the maximal var(a/ X) is necessarily attained for a being a standardized 

eigenvector of 3 orthogonal to 771, so one of 7ya,..., Yq. That it is indeed y2 can be verified 

as in a), see (S1). 
Note. One can proceed similarly to see that the jth principal component Y; of X is the 
standardized linear combination of X with maximal variance among all linear combinations 
orthogonal to the linear combinations of the first 7 — 1 principal components. Note that we 
already know that the jth principal component Y; equals VX with var(Y;) = Aj. We have now 
learned from this exercise that this standardized linear combination of X maximizes var(Y;) 
under all standardized vectors in R? which are orthogonal to all previous such vectors. 
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Review 


Solution 7.1 (The concept of a copula) 
A copula is a multivariate distribution function with standard uniform univariate margins. 


Note. This is a probabilistic definition, based on the concept of distribution functions. An 
(equivalent) analytical definition is the following; see MFE (2015, Section 7.1.1). A d-dimensional 
copula is a function C : [0, 1] — [0,1] which satisfies the following three properties: 


1) C is grounded, that is C(u) = 0 if uj = 0 for at least one j € {1,...,d}; 

2) C has standard uniform univariate margins, that is C(u) = uj if u = (1,...,1,uj, 1,..., 1) 
for any j € {1,...,d}, u; € [0, 1]; 

3) C is d-increasing, that is the C-volume 


d d . : , ; 
Aay = F (7125 C(ai l7, ... aldol“) 
ic (0,1)2 


satisfies A(g. 5C > 0 for all a, b € [0,1]? with a < b. 


Solution 7.2 (The importance of Sklar's Theorem) 

The first part of Sklar’s Theorem states that any d-dimensional distribution function F with 
univariate margins F1,..., Fy can be decomposed via F(a) = C(Fi(z1),..., Falza)), £ € R4, 
where C is a copula. Since C combines (or ‘couples’, hence the name) the margins F),..., Fy 
to F, it is precisely the function C that captures the dependence between random variables 
Xı ~ Fy]412, X4 ~ Fy; this allows one to study multivariate dependence independently 
of the margins. If Fj,..., F4 are all continuous, then C is uniquely defined and given by 
C(u) = F(Ff-(ui),..., Ff (ua)), u € [0,1]7. In this case, statistical applications often exploit 
the decomposition F(x) = C(Fi(zi),..., Fa(za)), for example for estimating F from available 
data by separately estimating Pj,..., Fy and C. As an example, in case the densities exist, 
if d — 100, each of the margins has two parameters and C has a single parameter to be 
estimated, then a maximum likelihood estimator for F would require to solve a 201-dimensional 
optimization problem. In contrast, treating the estimation of the margins and of C separately 
requires to solve 100 bivariate optimization problems and one univariate optimization problem; 
the former can often be done in parallel (if the estimation of the copula parameter is based on 


pseudo-observations, all 101 low-dimensional optimization problems can be solved in parallel). 


'This is often both faster and numerically more robust. 
The second part of Sklar’s Theorem states that any function F of the form F(a) = 
C(Fi(z1),..., Fa(zq)), £ € RH, for a copula C and marginal distribution functions Fj,..., Fy 
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is a d-dimensional distribution function. This allows one to construct tailor-made multivariate 
distribution functions and is often used for model building, sampling and stress testing. For 
example, if various types of losses are observed in different business lines but not all simultane- 
ously, one often has enough data to estimate each of F1,..., Fa but not their joint distribution 
function F. In stress tests, copulas can then be used to formulate dependence scenarios and 
their effects can be investigated by means of, say, simulation. 


Solution 7.3 (On the importance of the three defining properties of copulas) 

Since 0 € W(u) — mak E 4 jk uj— d+ 1,0} < max{(d — 1) — d + 1,0} = 03 up = 0 
for at least one k € {1,...,d}, W is grounded. W also has U(0,1) univariate margins since 
W(u) = max{ (u; +d-— 1)-—d+1,0} = uj if u = (1,...,1,uj,1,...,1) for any j € {1,...,d}, 
uj € [0,1]. Clearly, W is also increasing in each component uj. However, for d > 3, W is not 
d-increasing and thus not a copula; see MFE (2015, Example 7.24) where it is shown that the 
W-volume A(1/2,1}W = 1 — d/2 which is smaller than 0 for d > 3. 


Solution 7.4 (Special cases of normal variance mixture copulas) 

A d-dimensional normal variance mixture copula is the copula of X = /W AZ, where W is a 
non-negative random variable, AA' — P for P being the correlation matrix with off-diagonal 
entries all equal to p € [-1, 1] and Z ~ N4(0, I4) being independent of W. 


a) If p — 1, A being a matrix of 1s in the first column and 0s otherwise satisfies AA' — P. In 
this case, X = VWAZ = VW(Zi,..., Z1), so the components of X are comontone and 
thus the copula C of X is the comonotonicity copula M. 


Note. For Gauss copulas, one can also argue differently. Since X ~ N4(0, P) with corr(X) = 
P and p = 1 corresponds to each of X»,..., X4 being a linearly increasing function of 
X1, one knows that X4,..., X4 are comonotone. However, for t copulas, for example, this 
cannot be done in the same fashion in general since for X ~ ty(v,0, P), corr( X) = P only 
holds if v > 2 (so if the covariance matrix of X exists). 

b) If p = 0, we can take A = I4 and so X = VWAZ = VWZ = (VWZ,...,VWZqg). E W 
is constant almost surely, in which case the normal variance mixture copula is a Gauss 
copula, then p = 0 implies that the components of X are independent and thus that the 
copula C of X is the independence copula II. In general, this argument does not hold 
anymore if W is non-degenerate and p — 0 will not lead to the independence copula; see 
also Exercise 6.3 b). 


c) Ifp=-1,A= (4 8) satisfies AA’ = P. In this case, X = /WAZ = /W(Zi,—Z41), so 
the components of X are countermontone and thus the copula C of X is the countermono- 


tonicity copula W. 


d) p= —1 is not a valid parameter anymore for d > 3 since p must satisfy p > —1/(d — 1) for 
all d > 2 in order for P to be a proper correlation matrix with off-diagonal entries p; see 
Exercise 6.26 b). 
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Solution 7.5 (Correlation and rank correlation) 
1) p does not exist for all pairs of random variables (finite second moments are required). 
2) p depends on the marginal distribution functions of the underlying random variables. 


3) p is only invariant under strictly increasing linear transformations on the ranges of the 
underlying random variables (but not under strictly increasing transformations on the 
ranges of the underlying random variables in general). 


Solution 7.6 (Maximal VaR of the sum) 
a) This is true, as proved in MFE (2015, Theorem 7.28). 


b) This is true since var(X1 + X2) = var( X1) + var(X2) + 2cov(.X1, X2), so the variance of 
the sum becomes maximal if the covariance is maximal, thus if the correlation p(.X1, X2) is 
maximal. 


c) This is false in general. Value-at-risk is comonotone additive, see MFE (2015, Proposi- 
tion 7.20), thus subadditive under the comonotonicity copula M. Since M implies maximal 
linear correlation, value-at-risk is subadditive under maximal linear correlation. However, 
VaRo(X1 + X2) can very well be larger (and thus superadditive) under a copula different 
from M (so maximal correlation does not in general imply maximal value-at-risk of the 
sum); see, for example, Exercise 8.21. 


Note. See Exercise 8.5 for a similar exercise but for expected shortfall. 


Solution 7.7 (VaR for the sum of two normal risks) 

We know that specifying the marginal distributions (both normal here) and the correlation 
coefficient p does not uniquely determine a multivariate model; see MFE (2015, Fallacy 1 and 
Example 7.26). Hence, any joint distribution with normal margins and correlation p provides a 
distribution for X4 + X» and thus a (typically) different value for VaR4(.X1 + X2). Unless such 
a joint distribution is uniquely specified, VAR4 (X4 + X2) is not uniquely determined in general. 


Note. In Exercise 6.1, only the (normal) margins are specified, above we specify the (normal) 
margins and p. Either way, we know by Sklar's Theorem that only specifying a whole function 
(namely the copula) in addition to the margins uniquely determines a joint distribution if the 
margins are continuous. 

This was done in Exercise 6.17 a), where we specified the joint distribution of X1 and 
X», that is the distribution of (X1, X2). In this case VaRa(Xı + X2) is uniquely determined 
as both margins and copula are specified. For example if (X1, X5) are jointly normal, then 
VaRa4 (X4 + X3) is known explicitly as X + X» is (univariate) normal in this case; see MFE 
(2015, equation (6.12) and Example 2.11). 

Also note that a model based on specified margins and p does not even necessarily exist; 
see MFE (2015, Fallacy 2 and Example 7.29). This is not a problem here, as our margins are 
normal and thus there is always a joint normal distribution with these specified margins and 
correlation p. 
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Solution 7.8 (Distinguishing meta-distributions from scatterplots) 
a) The top-left plot. 

b) The middle-left plot. 

c) The top-middle plot. 

d) The bottom-right plot. 

e) The middle-right plot. 

f) The top-right plot. 

g) The middle-middle plot. 

h) The bottom-middle plot. 

i) The bottom-left plot. 


Solution 7.9 (Estimation of a copula) 
In practice we typically observe data which we view as realizations of .X4,..., Xn ^» F (x) = 
C(Fi(zi),..., Fa(xaq)), where Fi,..., Fy are the margins (assumed to be continuous here) of F 


and C the copula of F. In a non-parametric setting, one typically considers as observations 
from C realizations of U,...,U, for U; = (Ui,...,Uijq), à € {1,...,n}, where 


n Lx Rij 
Ug Oe) a Do ea Ss 
k=1 


and Rj; is the rank of X;; among X1;,...,Xnj. The constructed Uj,...,U, (or their realiza- 
tions) are known as pseudo-observations and, as said, are considered as observations from C. 


Note. Note that the scaling factor n + 1 (rather than n) is used to guarantee that none of the 
Uj;’s is 1 which would lead to problems such as density evaluation for maximum likelihood 
estimation of copula parameters, for example. 


Basic 


Solution 7.10 (Extracting the copula from a joint distribution) 
a) For j € {1,...,d}, the jth margin F; of F is given by 


d 1/0 
F;(2;) = ,Iim. F(x) = , im. ex (-( a ) 
kE{1,.. d} \{J} kE{1,.. d} \{J} =l 


j 
= exp(—(0+---+0+05°+0+---+0)'/%) = exp(-1/z;), æj € (0,00), 


which is the distribution function of a unit Fréchet distribution. 
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b) The quantile function F7 corresponding to Fj is Ff (u) = 1/(—logu), u € (0,1). By 


Sklar's Theorem, the copula C of F is thus given by 


C(u) = F(Ff (ui),..., Fy (ua)) )) = ew(- (San — log uj)) r^) 


(-logujf) A u € [0, 1]7, 


which is a Gumbel copula generated by Y(t) = exp(—t!/®). 
c) The probability is P(X; > Fí-(0.95), Xa > Fj-(0.95)) = P(Fi(Xi) > 0.95, Fo(Xo) > 
0.95) = P(U, > 0.95,U2 > 0.95) = 1 — 0.95 — 0.95 + C(0.95,0.95) = 0.0300. In the 


independence case the probability would be 1 — 0.95 — 0.95 + 0.95? = 0.0025 so the ratio is 
approximately 300/25 = 12. 


d 


-es(-( X 


j=l 


Solution 7.11 (Copula of a normal random variable and its absolute value) Ex. p. 54 
For z € R, x2 > 0, the distribution function of (Z, |Z|) is 


F(zi,22) = P(Z € zi,|Z| € z2) = P(Z € zi, 2 € Z € 122) = P(—z2 € Z € min(zi, r2}) 


_ oo —~@(-22), min{x1,22} > —22, 


0, min{z1, £2} < —z», 
(x1) — 6(—23) = 6(z1) — (1 — ®(a2)), x1 < £2, x1 > —12, 
= 4 2(x2)- 1, z1 > T2, T2 2 —T2, 
0, FI = T2, 
(x1) + (x2) —], me [—22, £2], 
= 4 2®(r2) — 1, Tj T2, 
0, tı < —12, 


where ® denotes the distribution function of the N(0, 1) distribution; note that ®(—x) = 1— (x), 
x E€ R. Clearly, Fı (x1) = (x1) and Fo(x2) = limz, 400 F(z1, 2) = 2®(x2) — 1, with quantile 
functions Ff (ui) = 9-^!(uj) and F$ (u2) = 9^ !((us + 1)/2), respectively. Hence, by Sklar's 
'Theorem, 


C (ui, ua) 
$(871(u) + (Puz 2/2) -1, Mu) € aes epee s 
= 29 (9-7 (us +1)/2)) - 1, di a 
0, -1 (u1) < 9 (us + 1)/2), 
uy — (1 — u3)/2, ui € [(1 — u3)/2, (1 + u3)/2], 
= 4 Ua, uj > (u2 + 1)/2, 
0, uj < (1 — 13)/2. 
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Solution 7.12 (Minimal correlation for standard exponential random variables) 


a) 


b) 


The countermonotonicity copula W has stochastic representation (U,1 — U) for U ~ 
U[0, 1]. As the quantile function of Exp(1) is Fj(u) = —log(1 — u), j = 1,2, we ob- 
tain from (the second part of) Sklar's Theorem the stochastic representation (X1, X2) = 
(—log(1 — U), —log(U)); since 1 — U ~ U(0,1), we could have equally taken (X1, X2) = 
(—log(U), — log(1 — U)). 

By MFE (2015, Theorem 7.28), Pearson's correlation coefficient p is minimal if X1, X2 are 
countermonotonic. As they are, we know from a) that (X4, X2) = (—log(1 — U), — log(U)) 
has minimal correlation for standard exponential margins. As E(X;) — var(X;) — 1, 
j= 1,2, we obtain that the minimal correlation pmin is given by 


.. E(X1X2) - EQG)E(X3) _ EQG X2) -1 _ 
Pmin var (X1) var( X3) 1 


E(log(1 — U) log(U)) — 1. 


Using numerical integration to compute E(log(1 — U) log(U)) = J log(1 — u) log(u) du, we 
obtain Pmin ~ —0.6449. 


Solution 7.13 (Maximal correlation for Pareto Type | random variables) 

Let X; ~ Fj(v) = 1-1/2%i, x > 1, 6; > 2, j € {1,2}, with copula C. By MFE (2015, 
Theorem 7.28), p € [Pmin, Pmax] and p = pmax if and only if C = M, that is if C is the 
comonotonicity copula M. For C = M, (X1, X3) admits the stochastic representation 


(X1, X2) = (FEU), FO (U)) = (a7 U) ^, 1 — uU) 6 


for U ~ U(0, 1). Note that E((1 — U) 7) = 1/(1— r), r < 1, so that 


E(X1X3) = E((1 — U) /^(1— y) V9) = E((1 — uU) 0/9») 
= 1/(1 — (1/6; + 1/62)) = 1/(1 — 1/6; — 1/62). 


Since E(X;) = E((1— U)-/%) = 1/(1— 1/0;), j € (1,2), we obtain that 


x n 1 1 1 
cov( X1, X3) = E(X41X3) = E(X1) E(Xə2) = = 1/6; = 1/0; I 1/0; i= 1/0; 


With E(X7) = E((1 — U)-?/6;) = 1/(1 — 2/05) one has 


var(X;) = 1/(1— 2/8;) -1/0. - 1/6), j € (1,2), 


and thus 


1 1 1 
" 1—1/0,—1/@  1—1/011— 1/6 
Pmax OT — 2/81) - 1/0 — 1/63) 07 — 2/85) - 1/0 — 1/627) 
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Figure S.7.1 Maximal correlation pmin (left) and minimal correlation pmax (right) as functions 
of 01,05 > 2 for Pareto Type 1 random variables. 


Note. The left-hand side of Figure S.7.1 shows pmax as a function of 01,05 > 2. The maximal 
attainable correlation is particularly small if one of the parameters is small (close to 2) and the 
other one is comparably large. 


Note that we also included the minimal correlation pmin in Figure $.7.1, see the right-hand 
side (Pmin is not available analytically but allows for a representation in terms of a series). 


Solution 7.14 (Copulas and correlation ordering) 
By Hóffding's Lemma and Sklar's Theorem, 


cov( X1, Y1) = i jd (Cy (P 21), Fo(22)) = F (x1) Fo(22)) dai dz» 


< F i. (C2(Fi(x1), Fo(v2)) — Fi (23) Fo(22)) dz, day = cov( X», Y2), 


which, because of equal margins, implies that p(.X41, Yi) € p(X, Yo). 


Note. In particular, if (X2, Y2) ~ F with margins F}, F> with finite second moments, then 
F(xi,x39) > Fi(zxi)P5»(z2), 21,22 € R, implies that p(X2,Y2) > 0; in this case Ci is the 
independence copula and C5 the copula of F. 


Solution 7.15 (Copula-based dependence measures under simple transformations) 


a) Let X, = -X ~ F with distribution function F (£) = P(—X; € z) = P(X, > —2) 
1 — F\(—2) and let X = X3 ~ P» with distribution function F(x) = P(X3 < x) = P(X2 


IA l 
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Wx) = Fo(x/x). Then 


ps(—X1,X8) = ps, Xo) = pF (Xi), FG) = o(1 - RC C38). G/XD) 
= p(1— Fi(X1), Fo(X2)) = p(1 — U1, U2), 


where U; = F1(X1) ~ U(0,1) and U5 = Fo(X2) ~ U(0,1). Since h(x) = 1—z is a decreasing 
linear transformation, a short calculation implies that p(1 — U1, U2) = —p(Ui1,U2) = 
—p(Fi(X1), Fo(X2)) = —ps(X1, X2), so ps(—X1, X3) = —ps(X1, Xə). 
Note. We could have seen this in quicker way. Rank correlation measures capture the 
monotonicity of Xə as a function of X1. Since (X1, X2) can be transformed to (— X1, X3) by 
applying a decreasing transformation in the first component and an increasing transformation 
in the second, the monotonicity of the second component as a function of the first changes 
its sign, hence ps(— X1, X3) = —ps(X1, X2). 

b) By the invariance principle, see MFE (2015, Proposition 7.7), the copula of (U1, U2) is the 
same as the copula C of (U1, U2). Hence the coefficient of upper tail dependence of the 
copula of (U1, U2) is the same as the coefficient of upper tail dependence of C. 


Solution 7.16 (Archimedean generator properties) 

a) If limp 4. Y(t) > 0, then C cannot be grounded. To see this, consider u € [0, 1] with 
uj = 0 and note that V l(uj) = v !(0) = inf{t : y(t) = 0) = oo. Then C(u) = 
Va V 1(uj)) = v (oo) = limi Y(t) > 0, so C cannot be grounded. 

b) If ~(0) < 1, then C cannot have U(0, 1) univariate margins. To see this, consider u c [0, 1]? 
with uj = 1 for all k € (1,...,d Mj) and uj € (v(0),1] (by assumption a non-empty 
interval). Since i)! (u) = 0 for all u € (v(0),1], C(u) = Va qoi ree 15) = 
v(0) A uj for all uj € (v(0), 1], so C cannot have U(0, 1) univariate margins. 

c) Note that for c 0, w(t 
V^ !(u) = v1 (u)/e, u € [0 


)esg(dw,t:z0,1an Archimedean generator with inverse 
, 1]. The Archimedean copula C generated by ~ equals 


C(u) = db (u) +o +O (ua) = (eo (u)/e -+W (ua)/0)) 
ow Mur) +: +y (ug) = C(u), we [0, 1f". 


d) We know that C(u1, u2) = Y(Y (ui) + Yt (ua)), so substituting t = v^! (u) leads to 


m CUm quo GU TUE e THO 


u—04- u u04- u ^ o w(t) : 


Solution 7.17 (Clayton copulas) 


a) Let 0 » 0. First, y(t) = (1+ t)-V9, t > 0, is an Archimedean generator since v : [0,00) > 
(0, 1], 2 is continuous, decreasing, satisfies (0) = 1, v(oo) = lim, ss Y(t) = 0 and is strictly 
decreasing whenever positive. In order for v to generate an Archimedean copula in all 
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dimensions d > 2, 7 must be completely monotone, that is (—1)*y“)(t) > 0 for all k € No, 
€ (0,00); since w is positive, we have left to check k € N. For k € N, 


vm - (D... (b= na eerie = aeg Tt (C1 - d 
Eu ! - 57); 


l=0 


so that (—1) y (t) = (1 + £710- TT 1 (1 -- 1/0) > 0 for all t € (0,00). 
b) By Exercise 7.16 d), 


1+ 2t E 1 24 —1/0 
= dg (YE im (10802 gne 
too w(t) too\1+t t3ooN1/t 4-1 


c) Let II(u..;) = [Tí ux and A4(0) = 3574 H(u..;)? — (d — 1)I(u)?. Then 
kj 


—1/0 d, d -1/0 
-0 -íu uf \— (d u)? 
D= (Xs (4-5) =m GU £) - « - one) 
d —1/8 
v (ome)? (a= DM?) = nonu) 
2 
= 11(u) exp(- los(hu(0))/0). (S1) 


We have thus left to show that limgs94 log(hu(@))/@ = 0 for all u € (0, 1)7. To this end, 
fix u € (0, 1)7 and note that 


hi (0) = 3 E(u)’ log(IH(u..;)) — (d — 1)H(u)" log(I(u)), 


M0) M0) — Xi log(IHI(u..;)) — (d — 1)log(II(u)) 


lim log(hy(6))/0 ha(0)  hu(0) G1) 


60+ 0 


I 
E 
| 


i 
eo 
+ 


Il 
Mes 
Mea 


S 
ll 
= 
em 
+% Il 
Ge 


log(ux) —1 en (uj) 


I 
Ma 
Ma 


log(ux) —1) en uj) 


> 
ll 
= 


KS. 
YK II 
re 


d 
= Md — 1) log( (ux) — (d—1 X at (uj) 


It follows from (S1) that limg_,9; Colu) = II(u) (independence copula). 
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d) Suppose, without loss of generality, that u; € (0,1) and uj, < u; for all j € {1,...,d}\{k}. 
Furthermore, let h4(0) = Y (ug/uj)? — (d — 1)u. Then 


w= (Sua 4-4) = a(l = Duk) = aha 


j=l 
If uj = ux for all j, then hy(@) = d — (d — 1)uf which converges to d for 0 — oo and so 
Colu) > uk: d = Uk. (S2) 
If not all uj are equal to uz, we can write 


Co(u) = uy exp(— log(hu(0))/0) (S3) 


and apply l'Hópital's rule (case *-/oo") to see that 
h! (0 


jim au Hal p, Defeat) ow un fey) - (d - uf loglar) 
= S Ituj-u,) log (uz /u;) e 
"SE Ituj-u,) 


where we used that (uj/u;)! — Iju;—u,) for 0 — oo, that uf — 0 for 0 — oo and that 
for each j € (1,..., d], either Itu;=u,} = 0 or log(ur/u;) = 0; note that the denominator 
cannot be 0 since uj = uj; for j = k. It follows from ($2) and (83) that lime. ,4; Colu) = 
uk = min{ uz, ..., ua} = M(u). 


Ex. p. 55 Solution 7.18 (Outer power Archimedean copula) 


a) The inverse 7t of V is given by 9-!(u) = v-!(u)^, u € [0,1], with derivative (7t)! = 
Byt (u) Tt!) (u), u € (0,1). By MFE (2015, Proposition 7.49), 


" Ex ORE. BIOL ; 
a rod Ue. Tuy on 
01,4 fh dow) Hu) 
=14 Bly yw) du = «S du D du) 1) /4 
BE oio M, 


b) Let C denote the (outer power) Archimedean copula generated by w. Substituting t = 
v^ 1(u), the coefficient of lower tail dependence A, is given by 


dez dieh C(u, u) EE p24 (u)) 


= = 1 = lim = lim = 
u04- u u04- u u04- u u04- u 
1/8 (91/8 
— dm O — oua mg V OVID 
i—oo v(t) i—o0o v'(t) 
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where the last equality follows from an application of l'Hópital's rule (case ‘0/0’) assuming 
the limit to exist. Similarly, the coefficient of upper tail dependence A, is given by 


mee 1 — 2u + C(u, u) 


oque Lt dOMPy Mw) — o 1—29(0) tyle) 


u1— l—u ^ ubl- l—u t350+ _ w(t) 
z) 1/By) _ = 1/8 ! (91/B 
= lim 2C AND GE — 2 — lim 1— v(2 ^t) —2. 91/8 lim ee: 
per 1— y(t) t30- 1-—w(t) t0+ y(t) 


where the last equality follows from an application of l'Hópital's rule (case ‘0/0’) assuming 
the limit to exist. 

c) Using that Kendall’s tau of a Clayton copula is p, = 0/(@+ 2), a) implies that Kendall’s 
tau of the corresponding outer power Clayton copula is 


(0 +2) 9 2 


pr =1-(1-p,)/B=1- 02 IP le m 


With w(t) = —1(1- t)-?-1, b) implies that 


x (91/68 —1(4 + 91/84)-1/0-1 1/B4\ —1/0—1 
X - 2/8 tim PCY _ ove lim g( t ) me sae 1+2 J 


v" (t) too -$5 + t)-1/0-1 Bai] +t 
= aU Bro B a ae — 9-1/(88) 


and similarly 


: 1/91/84 1/84) -1/0-1 
Jee 2—2" lim call JM SUB, ng tr nd = 2 — 2"/ê, 
120- l-t 


Solution 7.19 (Farlie-Gumbel-Morgenstern copula) Ex. p. 55 
a) 1) C(u1,0) =0+0=0 for all uz € [0,1] and C(0, u2) =0+0 = 0 for all u2 € [0,1], so C 
is grounded. 
2) C(uj,1) = ui +0 = ui for all u, € [0,1] and C(1, u2) = 04+ u2 = us for all u» € [0,1], 
so C has uniform margins. 


3) To show that C is 2-increasing, it suffices to show that C has a density c(u1, u2), 
u1,ua € (0, 1). Clearly, C is differentiable with 


9? 
c(u1, u2) = Dum C Us ua) —140(1—2uj)(1—2u3), uj,u» € (0,1). 
Since 0 € [-1,1], 1 — 2u1 € (-1,1) and 1 — 2u» € (-1, 1), we have that c(ui,u2) > 0 
for all u1, ug € (0, 1) and 0 € [-1, 1]. Therefore, c is a proper density and thus C is a 
proper distribution function. 
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Note. An alternative solution is the following. Let a = (a1, a2) € [0, 1]? and b = (b1, b2) € 
(0, 1]? with a € b1, az € b2. The C-volume A(a jC of (a, b] = (a1, bi] x (az, b] is 
A(a,jC = C(b1, b2) — C(a1, b2) — C(b1, a2) + C(a1, a2) 
=  bib34 A(1 5i)(1 b5)6405 a,b (1 a1)(1 b5)a405 
biaz 0(1 bi)(1 à2)01a2 + a1a2 + (1 — a1)(1 = az)a1a2 
= (b1b2 — q109 — byag + 0102) + e( (1 = bz)b2((1 = by) by = (1 = aı)aı) 
= (1 = a2)ao((1 = by)by = (1 = a1)a1)) 
Note that bib» aıb2 biao + aia = (bi — a1) (be I az) and (1 = by )by —= (1 = d1)Q4 = 
(by — a1)(1 — a4 — 51). Therefore, 
Away? = (b us a1)(bo = a2) + ((bi = a1)(1 — Qi — 5i)((1 E b2)b2 = (1 = a2)a2)) 
= (b x a1)(bo — a2) + A((b1 — a1)(1 — 01 — by) (b2 — az)(1 — 9 — by)) 
= (b — a1)(bo — az)(1 + 0(1 ay b)(1 a2 bz)) > 0, 


and so C is 2-increasing. 
b) Since C(ui, u2) = u1u3 + 0(1 — u1)(1 — u2)u1uo = uou; + O(1 — u3)(1— u1)uzui = C(us, u1), 
u1, U2 € [0,1], C is exchangeable. 
c) The survival copula C of C, see MFE (2015, equation (7.16)), is given by 


ES 


C (u1, u2) =] (1 ui) (1 ua) + C(1 —uj,1— ua) 
—1 + ui + u2 + (1 — ui)(1 = u2) + buruo(l — ui)(1 — ua) 
= uius --08ujuo(1— ui)(1l— u2) = C(uj,uo), u,u» € [0,1]. 


d) By MFE (2015, Corollary 7.35), Spearman's rho of (U1, U2) ~ C is given by 
1 pl 
ps (Ui, U2) = 12 f ri (C(u, ua) = uiua) dui; dus 
1 pl 
= 120 f n (1 = ui)(1 = uz)u1u2 dui du» 
o Jo 
1 2 
= 120( f (1— u)u du) = 120(1/2 — 1/3)? = 0/8. 
0 
The range of attainable ps(U1, U2) is thus |-1/3,1/3], so Farlie-Gumbel-Morgenstern 


copulas are only adequate models for data with a limited range of pg. 
e) We have that 


Due to radial symmetry of C, Ay = Ai = 0; see MFE (2015, equation (7.35)). 
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Solution 7.20 (Kendall’s tau as a correlation) Ex. p. 55 


E(I(X1 < Xi, X2 < X5))- i- 
20-30 -3) 
g(I(X1 € X{M(X2 < X5)) - E(I(Xı < X)))E(I(Xo < X2)) 
var (I(X1 < X1) var(I(Xo < X2) 

p(I(X1 € Xi), I(X2 € X2)). 


pr(X1, X2) = 4PQG € Xi, X2 < Xj) -1- 


Kendall’s tau thus equals the correlation coefficient of the indicators [(X1 € X1) and I(X2 < 
X3). 

Note. If F3, F5 denote the distribution functions of X1, X», respectively, and if F1, F5 are contin- 
uous, then we also have that p+(X1, X2) = p(I(Ui < U1), I(U2 € U2)), where (U1, U2), (U1, US) 
are independent random vectors following the copula C of (X1, X2). 


Solution 7.21 (Blomqvist’s beta) Ex. p. 56 
a) Since P((X1 — mi)(X» — m3) < 0) —-1- P((X1 — mi)(X» — m3) > 0), we have 


pg(Xi, X2) = 2P((X1 — mi) (Xo — m3) > 0) - 1 

= 2(P(Xi > mi, X2 > m2) + P(X1 < m, X2 € me) ) —1 

= 2(P(U; > 1/2, U? > 1/2) + P(U1 € 1/2, U2 < 1/2)) — 

= 2(1 — 1/2 — 1/2 4- P(U, € 1/2, U2 € 1/2) + P(U; € 1/2, U> € 1/2)) — 
= 4P(U, € 1/2, Us € 1/2) — 1 = 4C (1/2, 1/2) — 1, 


where (U1, U2) ~ C. 
b) By a), 


C(1/2,1/2) - 1/4 C(1/2, 1/2) - TI(1/2,1/2) 
eae eed 1/4 M(1/2, 1/2) — 11(1/2,1/2) 


Note. Another expression for Blomqvist’s beta is 


CIS 1/2) T7212) + (2, 12) — 1101/2, 1/2) 
M(1/2,1/2) — I1(1/2, 1/2) + M(1/2,1/2) — 11(1/2, 1/2)' 


pg (Xi, X2) = 


where C denotes the survival copula of C, given by C(ur, ug) = —14ui-uscC(l—u;,1—u3), 
u1, U2 E (0, 1]. 
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c) By a), 


pg(Xi, X2) = 4P(X1 < m, Xo < m3) - 1 
E(I(X1 € mi, X2 € m3)) — 
86-310 - 3) 
E(I(X1 € mi)I(Xo € m2)) - E(I(X1 € mi))E(I(Xo € ma)) 
V var(IGXs < m1)) var(I1(X» < mz) ) 
= p(I(X1 € mı), I(Xo € m3)). 


l. 
2 2 


Blomqvist's beta thus equals the correlation coefficient of the indicators J(.X4 < mı) and 
I(X2 < mg). 

Note. If F1, F> are continuous, we also have that pg(X1, X2) = p(I(U1 < 1/2), I(U5 < 1/2)), 
where (U1, U2) follows the copula C of (X1, X2). 


Ex. p. 56 Solution 7.22 (Convex combinations of copulas) 


a) We show that C is grounded, has U(0,1) univariate margins and is d-increasing. 
1) Let uj = 0 for at least one j € {1,...,d}. Then Ci(u) = 0 and C»(u) = 0 since Cı and 
Cz are grounded. Therefore, 


C(u) = yCi(u) + (1 — 7)C2(u) = y- 0 (1-7) -0=0, 
so C is grounded. 


2) Let u = (1,...,1,u5,1,..., 1) for any 7 € {1,...,d}, u; € [0,1]. Then Ci(u) = u; and 
C»(u) = uj since C? and C2 have U(0,1) univariate margins. Therefore, 


C(u) = yC1(u) + (1 — $)Cx(u) = quj + (1 — y)uj = uj, 
so C has U(0,1) univariate margins. 


3) Let a,b € [0,1]? with a < b. Then A(agjCi = 0 and A(q,5C5 7 0 since C1 and C» are 
d-increasing. Therefore, 


Alab] = Aca (¥C1 + (1 — y)C2) = A (a Ci + (0 — Y) A (a C2 = 9, 


so C is d-increasing. 
b) An algorithm for simulating U ~ C is given as follows: 
1) Generate U ~ U(0, 1). 
2) If U € y, return U ~ Ci, otherwise return U5 ~ C5. 
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e the Law of Total Probability which 


P(U <u) =P(U <u,U <y)+P(U <u,U > v) 
—P(U; <u,U <7) +P(U2 < u, U > 7) 
= P(Ui < u) P(U <7) +P(U2 < u) PU > 7) 
= Ci(u) y + C2(u) (1 — y) = C(u). 
By MFE (2015, Corollary 7.35), Spearman's rho equals pg = 12E(C(U;1, U3)) — 3 for 


ind. 


Ui, U> ^ U(0, 1). Since 


E(C(U1, U2)) 


E(yC1 (U1, U2) + (1 
E(C1(U1, U2)) 


2 


and y and 1—7y sum to 1, any linear transformation of 
of the linear transformations of E(C1 (U1, U2)) and 


E(C (U1, U2)) — 3 = y(12E(C1 (U1, U2)) 
= yps,ı + (1 — 7) ps,2- 


ps = 12 


+(1 


— y)C2(U1, U2)) 
— y)E(C2(U1, U2)) 


E(C (U1, U2)) is the convex combination 
E(C2(U1, U2)), so 


—3)+(1-7)(12 


E(C2(U1, U2)) — 3) 


With the notation as in b) and d = 2, the Law of Total Probability implies that 


E(C(U)) = yE(C1(U)) + (1 — y)E(C(U)) 
= 7(yE(Ci(U1)) + (1 — y)E(C1(U2))) 
= yE(Ci(U1)) + 4(1 — y)E(C1(U2)) + 
E(C1(U5)) = E(C5(U1)), we have that 

R(C(U)) = PEC (UD) +291 — 9) 


Since 4°, 2y(1 — y) and (1— 
convex combination of the linear transformations of 
By MFE (2015, Proposition 7.32), we thus have that 


pr = 4E(C(U)) - 1 


(dorm 


As 


E(C1( 


Tides 
£(C»(U1)) 


(C3(U»))) 


E(C2(U1)) + (1 — 7)E 
E(C2(U2)). 


Sa 


U2)) + (1 — 7)?E(C2(U2)). 


y)? sum to 1, any linear transformation of E(C(U)) is the 


E(C1(U1)); 


E(C1 (U2)) and E(C9(U»)). 


= 3 (4E(C (01) — 1) + 251 — 7) (4E(C (03) — 1) + (1 — 3) (4E(C2(U2)) — 1) 
= bra + 2*1 — 3) (AE(C1(U2)) — 1) + 1 — 9)? pez 

as claimed. 

Note. 

1) As an example, if Cı = M and C5 = W, then E(C1(U5)) = dr min{u,1 — u)du = 
2 fo’? udu = 1/4, so that pr = pra + (1.— 3? pra =P + (1-9)? (-1) = 2y- 1— ps. 
And if C? = M and Co = II, then E(C1(U5)) = E(C9(U1)) = Jm u? du — 1/3, so that 
pr = Y! py +2901 — 9)/3 + (1 — Y? oco = 3? + 24(1 — 7)/3 = vy + 2)/3; note, in 


particular, that the latter is not the corresponding 
of M and II. 


convex combination of Kendall’s taus 
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2) The hint holds by conditioning (and continuity): 


E(C4(U»)) — T 1p Ci (uw) dC2(w) = P(U, x U2) = P(U2 < U;) = 0 T C5(u) dC; (w) 


= E(C2(U1)). 


Ex. p. 57 Solution 7.23 (Bivariate Marshall-Olkin copulas) 
a) The survival function F of (X1, X2) is given by 


F(z1,22) = P(X, > 21, Xo > 23) = P(T, > z1, T3 > 21, To > £2, T3 > £2) 
= P(T; > z1, T> > z2, T3 > max(zi,2]) 
= P(T; > zi)P(T5 > z3)P(T3 > max(zi, x2}) 
= exp(—A 121) exp(—A223) exp(—A3 max(z1, r2}) 
= exp(—A 21) exp(—A2%2) exp(min{—A321, —A3%2}), 11, £2 € [0, 00). 


We obtain that F(x) = exp(—(A; + As)z) with corresponding generalized inverse F $(u- 
— log(u)/(Aj + Aa), j € {1,2}. With a; = A3/(Aj + A3) and thus A;/(A; +A3) 21— aj, j € 
{1,2}, it follows by Sklar's Theorem for survival functions, see MFE (2015, equation (7.14)), 
that, for all a1, a2 € [0,1], 


C(ui, uy) = FFP (ui), F$ (u)) 
= exp((1 — aı) log u1) exp((1 — a2) log u2) exp(min(o4 log u1, a2 log u2}) 
ug},  ui,uo € [0,1]. 


l-a, 


a2 
mon 


= ul “ug minus, ug} = min(ujul^ 


b) Let Vi, Vo, V3 ^7 U(0, 1) and note that, in distribution and for each j € {1,2}, 
T; = —log(Vj)/Aj ~ Exp(Aj) and T3 = — log(V3)/A3 ~ Exp(As), 
so that 


F(X) = Fy(min{Tj, T5)) = exp(—(Aj + As) min{— log(V;) Aj, — log(V3)/As}) 
= exp(max{ (log V;)/(1 — o), (log V3)/a;}) = max (V; / 079). vile \, 


A stochastic representation of C is given by (U1, U2) ~ C, where 
(Ui, Ua) = (Fi Q3), FQ) = (max (v; / 799, vil}, max [yz/0799, vafe) 
for 04,0» € [0,1], with the convention that, for any v € (0,1), v!/9 = 0. 


c) The coefficient of lower tail dependence is given by 


C : 2-a2 ,,2—01 
à = lim Cee) = lim mmm FM uM lim min(u 
u—0-- 


PG: yr } 
u04- u u 70+ u 


= To 
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and the coefficient of upper tail dependence is given by 


1 — 2u + C(u, u) 1 — 2u + min{u?-%, 42-91 


Au = lim = lim 
u>1l— l-u u>1l— l-wu 
1—2u+ y2-min{ar v2} NEM (2 " min{ay ag} )ul-min(o1,02] 
u>1— l-u u1— —1 


= 2 — (2 — min(o1,02]) = min{aj, ae}. 


Solution 7.24 (Sampling copulas and meta-distributions) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 7.25 (Fitting copulas to equity return data) 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 
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Solution 7.26 (The conditional copula of a Clayton copula) 
Let fi5:3(u1, u | us) denote the density of the conditional distribution function F15,3 of (U1, U2) 
given U3 = ua. Then 


(Y=) (us) 


1 


Pisis (u1, U2 | us) = = 


f3(us) 1 


fiz3(u1, U2, ua) c(u1, U2, U3) -»e( 2 97) [ 


1 j 


j 
and so 
3 


Fiaa (u1, u2 | ua) = T is Fizi3(v1, v2 | us) dva dv; = JT 971) (i7) (a. 


j=l 


The margins of F53(u1, u2 | u3) are 
Fyj3(ua | us) = Figg(ur, 1| us) = VI Qo (u) +o (u3) (Y7) (us), 
Foi (ua | us) = Figj3(1, u2 | us) = AQ (u2) +o (u3) (Y7) (ua) 
with corresponding quantile functions 
Fýglu | us) = (Q^) (Y (Y (ua))ui) — ^ (ua)), 
F3s(u» | us) = v(QU^) (Y (Y (ua))u2) — v^ (ua)). 
By Sklar's Theorem, the copula C123 of F15j3 is thus given by 


Cia (u1, U2 | ua) = Fia (Ffa (u | us), F3(uo | ua) | us), u1, U2 € (0, 1]. 


Since 1 (Fiz (uj | us)) = (0) 1 (W (971 (us))uj) — v7! (us), j € {1,2}, 


Csoj (1 , ua | us) = W (Qu) Qu (Y Qus) )ui) + (WE Qu Q1 (ug))u2) — v7 (us) ) (Y7) (us). 
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Ex. p. 57 


Ex. p. 58 


7 Solutions to Copulas and Dependence 


With p(t) = (1+ t) V8, 4 1(t) = t? -1, w) = —(1/9)(1 4 t) U^ and (và) !(t) = 
(—0t) -9/(*9) _ 1, we obtain that 


(U^) 1 QU Q7 (us))uj) = (-6(-Q/9)0 + (ug? — 1) Pug) OD —1 


= (-6e(-(/0)uj*?)u;) 9/6*0 — 1 = (ug Puy) OY — 1 


= u3 zu MP 1 


and thus, since (i-1)'(t) = —0t-?-!, that 


Cyiajs(u1, uo | us) = (us uy OY —1 + usu; OY — 1 — (uz? — 1)) (971) (us) 
= wl (uz? (uy CTP + uz” OHD — 1) — p (-8juz71 

1/0)(uz *(u; O + uz O — 171/9710 gy 071 
AETA 0/(0+1) + uz OD 7 jee 


(= 
=u 
-6/041 | yzê/(@+D _ 1)-(641)/6 


( 
(uy eee Ug 


6/69) _ 4)-1/(6/(6+1)), 


This is a Clayton copula with generator Y(t) = (1+ t) -/ 0/(-)). so parameter 6/(0 + 1). 


Ex. p. 58 Solution 7.27 (A flexible bivariate copula construction) 
a) 1) Since g1(0) = g2(0) = 0, C(0, u2) = 0+ g1(0)g2(u2) = 0 and C(u1,0) = 0+ g1(u1)g2(0) = 
0, so C is grounded. 
2) Since g(1) = go(1) = 0, C(ui,1) = ui + g(u)gY(1) = ui and C(1,u2) = u2 + 
gi(1)g2(u2) = us, so C has U(0, 1) univariate margins. 
3) By the assumptions on gi, g2, the function C admits continuous partial derivatives with 
respect to both variables and, by the assumption that g}(u1)g5(u2) > —1, satisfies 


o? 


D ON —~— C(u, u2) 2-1--gi(ui)g9(u23) > 0, u1, ua € (0,1). 


c(u1, u2) = 


Hence c is a density and thus C is 2-increasing (as a proper distribution function). 
b) By Hóffding's Lemma and since the margins of (U1, U2) are U(0, 1), 


cov(Xi, X3) gr Heel 
= , V d d 
~ var Xi var Xa — (u1, u2) — u1u2) dui dua 


1 
= nf [ gi(ur)g2(u2) dui du» = ef gilu) du f ga(u2) du». 
c) If gı is point symmetric about 1/2, then Ie g1(u1) dui = 0 and thus p = 0; similarly for go. 


d) Consider g1(u1) = u1(1— u1)(1 —2u1) € [—1,1], wi € [0,1], which is point symmetric about 
1/2 as the product of uj(1 — u1) (symmetric with respect to uj = 1/2) and 1 — 2u; (point 
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symmetric about u; = 1/2). Note also that gi (u1) = 6u? — 6u; +1 which can be checked 
to be in [-1/2, 1]. Furthermore, consider g»(u2) = 0u»(1 — u2) € [-1,1] for all 0 € [-1, 1] 
and note that g5(u3) = 0(1— 2u2) € [-1,1]. Hence, g1(u1)g5(u2) > —1 for all 0 € [—1, 1]. 
Therefore, for all 0 € [—1, 1], g1, g2 fulfill the assumptions for C to be a copula by a) and 
p — 0 by c) for all 0 € |-1, 1] since gı is point symmetric about u1 = 1/2. 

e) Any gi, go symmetric about 1/2 lead to C being radially symmetric since 


Ĉ(u1, ug) = —]-cu;-us-cC(l-uj1- u2) 
= —1 +u, + uo + ((1 — u1)(1 — u2) + gı(1 — u1)g2(1 — u2)) 
= —1 + u + ug + (1 — u — ua + uru + gi(u1)go(u)) 
= uiu» + gı (u1)g2 (u2) = C (u1, u2), u1, u2 € [0,1]. 


f) If g1 Z go, then C cannot be exchangeable. As a concrete example, take gı (u) = sin(ru)/r 
and go(u) = u(1 — u) (which satisfy all assumptions). 


Solution 7.28 (The Liebscher construction) Ex. p. 58 
a) Let (X1, X2) ~ F. Then, using independence of (U1, Vi) and (U2, V2), 


F(x1, £2) = P(X1 < z1, X2 < x2) 

= ue <t pe UE E p eus mn < 2x9) 
(Ui 
( 


=P < af, U: 55a) © Vi <2h,Ve < 23° a 
U, < a$, V E € z] ^, Vo € a5 ^) 


= Cx(z$, 28)Co(21-*, 25°), 21, £2 € [0, 1]. 


| 
d 


Since the margins Fi, Fy of F are Fi(z1) = F(z1,1) = Ci(aQ, 1)Co(a1 ^,1) = aul = 21, 

x, € [0,1], and, similarly, Fə(x2) = F(1,z2) = x2, z2 € [0,1], the copula C of F equals 

C(u1, uz) = F(Fj (ui), Fy | (u)) = F(ui, ua), ur, u2 € [0, 1, as stated. 

b) i) For (a, 8) — (0,0), we obtain C(u1,u2) = C1(1,1)C2(u1, u2) = C2(u1, u2), u1, U2 € 

[0, 1); 

ii) For (o, 8) > (1,1), we obtain C(u1, u2) = C1(ui,u3)C9(1,1) = Cı(u1, u2), u1, U2 € 
(0, 1]; 

iii) For (a, 8) > (1,0), we obtain C(u;, u2) = Ci(ur, 1)C2(1, u2) = uiuos, u1, u2 € [0,1] (the 
independence copula); and 

iv) For (a, 8) > (0,1), we obtain C(u1, u2) = C1(1, u2)C9(u1, 1) = uiua, u1, u2 € [0,1] (the 
independence copula). 


c) We have 
A = ji C(u, u) 1 Ci(u ,U Jesu ae) -— Keg Pur) 
u04- u u04- u u—>0-+ ucula 
Oo ye C —a l-a 
= 2 lim 2 1 Aa = ÀyAi2. 
u—0+ ue u04- u —a $ , 
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Ex. p. 59 Solution 7.29 (The Rosenblatt transformation and the conditional sampling method) 
a) By (E1), for u; € [0,1] and u1,...,uj—1 € (0,1), 

gii j - 

E 1- Hou VOZ 1 dd k 

52. 1: E e 3 


E m 


Cjn,....j-1(u5 | u1;..., uj-1) = 


and hence | 
Fig alte sss ea) = (ED) e (yor (Zoa 2) uj) -Xew). 


Note. (S1) is indeed in [0,1]. Being non-negative follows from the fact that the numerator 
and denominator have the same (alternating) sign. Being bounded by 1 can be seen by 
considering two cases: If j — 1 is even, then /U-U is non-negative and decreasing (since 
vU) < 0), so omitting the additional term i 1(uj) > 0 in the numerator provides an upper 
bound to (S1), which is one. And if j — 1 is odd, then v U— is non-positive and increasing 
(since Y?) > 0), so omitting the additional term i 1(u;) > 0 in the numerator also provides 
the upper bound 1 to (S1). 


b) y(t) = (12-t)-V?, t > 0, 0 » 0, us inverse ^ T ) = u7 —1, u € [0,1]. By Solution 7.17 a), 
w)(t) 2 (-1)* a + t)- 9-5 TIEAL + 1/0), t € (0,00), k € No, with the empty product 
being defined as 1. From (S1), we his obtain that 


1+5 u7? — TG) 
k=1\"%k 


C; ; 1005 | 04,..., uj 1) = 
jl... 1 (Uy | ua j-1) (+D 1 (uz? NA 
Ed DO xia ru’ 
and hence 
Cj, i-i (Uy | U1, +. +, Uj—1) 


bcp joi j-1 —1/8 
T Grae) 


i (jo Sa) wu) ((a Ga) Y) = )) 


= (1+ (1-( j— 1) DT \(u; ~1/(j-141/8) = 
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Solution 7.30 (Archimedean Marshall—Olkin copulas) Ex. p. 60 
Let zr; V 2 = max{x1, 22} and z1 A x2 = min{z1, 272}. Assume that vo(t) = E(exp(—Vot)), 
€ [0, oo). Conditioning on Vo ~ Fo and utilizing the stochastic representation (W1, W2, W3) = 


ind. 


(wo(E1/Vo), V'o(E2/ Vo) , Wo(E3/Vo)) for E1, E2, E3 ^7 Exp(1), the joint survival function F of 
(X1, X2) is given by 


F(a, 22) 
= P(W, € e-*!, Wa € e*?, Ws < e122) 


f P(uo(E1/vo) X €-?*, vo(E»/vo) < e€-*?, do(Es/vo) < e- 91 V*2) dFo(vo) 


p i P(Ei > vovg (e 7), E» > vovg (e 77), Es > vous (e 0*9) dFo(vo) 


| * exp(—v0 (Wp ! (7?) + ig Mem) + ui! (e 01 22))) AF (vo) 
= yoly 1 (6791) + i (e) + ig Mem) 


= = Cole only eg ?2,g (e1Ve2)), £1, T2 € [0, 00). 


The corresponding marginal survival functions are F;(z;) = vo(wg (e?) + pg 1(e-?7)) = 
Vo(2ug (e-7:)), aj > 0, (obtained by letting zy = 0 for k 4 j), j € {1,2}, with inverses 


Ff (uj) = — log bo(bo (uj)/2) = — log Go(uj), uj € [0,1], j € {1,2}. 


Therefore, the survival copula C of (X1, X2) is given by 


C(ux, ua) = FRE (ui), Ff (uz)) = Co(Go(ui), Goltz), c (Ps GoM log Go(u2)))) 


Solution 7.31 (Axioms of concordance) Ex. p. 60 

Let Fx, Fy denote the distribution functions of X, Y , respectively. Let C, Cn denote the copulas 

of the bivariate distribution functions F, Fn, n € N, and let U ~ C, Un = (Uni, Un2) ~ Cn, 

n € N. With this notation, we obtain: 

a) i) (Domain) Since ps(X,Y) = p(Fx(X),Fy(Y)) = p(Ui,U2) which exists for every 
distribution function of (U1, U2) (that is for every copula C) since var(U1) = var(U2) = 
1/12 « oo. 

ii) (Range) Clearly, p € [-1, 1] and thus ps € [71,1]. Furthermore, for U ~ U(0, 1), 


ps(W) = p(U,1 — U) = 12(E(U(1 — U)) - E(U)E(1 — U)) = 12/6 — 3 = —1 


and 


gs( M) = g(U,U) = 12E(U?) -3—4—3— 1. 
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iii) (Symmetry) 


ps CX, Y) = p(U;, U2) = 12( E(U1U2) = E(U;) E(U2)) = 12(E(U3U1) = E(U2) E(U1)) 
= p(U2, U1) = ps(Y, X). 


iv) (Independence) If X and Y are independent, then C = II and thus pg(X,Y) = 
£(U1, U2) = pI) = 0. 
v) (Change of sign) Note that —X ~ 1 — Fx(—z), so 


ps(- X, Y) = p(1— Fx(-(-X)), Fv(Y)) = o(1 — Ui, U2) 


= 12(E((1 — U1)U3) — E(1 — U1 )E(U2)) = 12(1/2 — E(U1 U2) — 1/4) 
= —12E(U1U2) + 3 = —(12E(U1U5) — 3) = —p(Ui, U2) = —ps(X, Y). 


vi) (Coherence) By MFE (2015, Corollary 7.35), 


1 pl 1 pl 
ps(C1) = 12 f f C4 (ur, u2) dui duo — 38 < 12 f n C2(u1, u2) dui dug —3= ps(C2). 


vii) (Continuity) By the Portmanteau Theorem, lim; ,5; E(Un1Un2) = E(U1U2) and thus, 
by MFE (2015, Corollary 7.35), 


im ps(Fn) = Jim g(U5,) = 12 Jim E(UniUn2) — 3 = 12E(U1 U2) — 3 = p(U1, U2) 
= ps(F). 


b) Consider (X,Y) ~ U((z € R? : ||| = 1}), that is, (X, Y) is uniformly distributed on the 
unit circle. A stochastic representation is (X, Y) = (Zi, Z2)/||(Z1, Z2)|| for Zi, Za ^^ N(0, 1); 
see MFE (2015, Corollary 6.22). Note that (—Z1, Z2) is distributed as (Z1, Z2) since 


IP(—Z1 € t1, Za € z2) = P(Z1 > —21, Z2 € £2) = P(Za € z3) - P(Z1 € —21, Z2 € v3) 
$(z5) — $(—21)6(z2) = (1 — $(—21))9(22) = $(z1)9(z2) 


— P(Z < 41,29 < 22). 


Therefore, the change of sign axiom implies that &(X, Y) = &(—X,Y) = —&(X,Y). We 
thus obtain that &(X, Y) = 0 although X and Y are not independent (since X? + Y? = 1). 


Note. For checking the axioms for Kendall's tau, see, for example, Hofert (2010, Theorem 1.7.8 
and thereafter). 
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Review 


Solution 8.1 (Acceptance sets) 


a) The acceptance set of a monotone and translation-invariant risk measure o is Ay = {L € 
M : p(L) < 0}, where M denotes a linear subspace of the set of all almost surely finite 
random variables. 

b) By MFE (2015, Proposition 8.6), o is convex if and only if A, is convex, and o is coherent 
if and only if A, is a convex cone. 

c) A; — (L € M : VaRG(L) € 0) 2 (L € M:TF(L € 0) > a}, so A, consists of all losses of 
M for which the distribution function is at least a at 0. 


Solution 8.2 (Dual representation of ES) 


a) By MFE (2015, Theorem 8.14), the dual representation of ES,(Z), a € [0,1), for L € 
£A (Q,, F, P), is given by 


ES4(L) = max(E9(L) : Q € Qa}, 


where Qa is the set of all probability measures on (Q, F) which are absolutely continuous 
with respect to P and for which the measure-theoretic density dQ/dP is bounded by 
(1— o)-1. 

b) Let 0 X o « f « 1. Since densities bounded by (1 — a)! are also bounded by (1— 8)! > 
(1— a Qa C Og. Since the maximum over the (larger) set Qg is at least as large as the 
maximum over the (smaller) set Qa, we obtain that ES4(L) = max(E9(L) : Q € Qa} < 
max{E®(L) : Q € Og} = ES,(L). 

c) If Li, L2 € M then 


ESa(Lı + L3) = max(EV(L, + L3) : Q € Qa} = max(E9(I4) + E2(L2) : Q € Qa} 
< max{E®(L1) : Q € Qa} + max(E(14) : Q € Qa} = ES4(L4) + ESa(L2), 


where the inequality holds since the two maxima on the right-hand side of the inequality 
can be obtained for different elements of Qa. 
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Solution 8.3 (Distortion) 


a) The distortion risk measure o(L) associated with a convex distortion function D (convex, 
increasing, absolutely continuous on [0,1] with D(0) = 0 and D(1) = 1) is defined by 


ot) = | FEO ADE) = [ rir ow) du, 


where the increasing function ¢ is the density of D (the so-called spectrum); see MFE (2015, 
Definition 8.16). 


b) It holds that 


a(t) =f ESUE) gto) ($1) 


where u is a probability measure on [0,1] related to the spectrum ¢; see MFE (2015, 
Proposition (8.18)). 

c) Comonotone additivity and coherence follow immediately from the integral representa- 
tion (S1) and the comonotone additivity and coherence of expected shortfall. 


Solution 8.4 (Subadditivity of VaR for jointly normal risks) 

For j € {1,2}, let L; = AX € M denote two losses from M. As a linear combination, L; ~ 
N(ur;, 01.) where jp; = Aj and o; = A;jXA;. Furthermore, L4-- Lo = (A14- A2) X € M and 
Lid La ~ N(up, 415,07, 415): Since corr(L1, Lz) € 1 and thus cov(L1, L2) € yvar(L1) var(La), 


OL, +L = var(Lı + L2) = var( L1) + var( L2) + 2cov(L4, Lə) 


< var(L4) + var(L2) + 24/var(L4) var(L2) = (ap, + 01)”, 
SO OL, 4L, € OL, + OL. Since, for a € [1/2, 1), 6-'(a) > 0, we obtain that 


VaRo(Li + L2) = Br e,  01441,9  (0) = up, + Hrs + 91,4 1,9 (a) 
X pp, + up, + (01, + 01,)8 (a) = ur, + 01,8 (o) + up, + 0r, (o) 
= VaR4(L1) + VaRa(L2), 


so VaR, is subadditive for all a € [1/2, 1). 
Note. For a € (0,1/2), ®-!(a) < 0 and so the inequality holds in the other direction; in 


particular, VaR is superadditive then. See also Exercise 2.22 b) for a special case of this 
exercise. 


Solution 8.5 (Maximal ES of the sum) 

Correlation is maximal if Lj and L are comonotone; see MFE (2015, Theorem 7.28) or 
Exercise 7.6 a). Comonotone additivity of VaRa immediately implies that ES, is comontone 
additive, so ES4(L4 + L2) = ES4(L1) + ESa(L2) if the correlation between Lı and Lə is 
maximal. Since ES, is coherent and thus subadditive, we in general have ES,(L, + L2) € 
ES4(L1) + ESS (L3) so that the upper bound is indeed attained (and thus ES, maximized) if 
Lı and Lə have maximal correlation. 
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Note. See Exercise 7.6 c) for a similar exercise but for value-at-risk. 


Solution 8.6 (Revisiting Markowitz) 
Let X ~ Eal, £, Y). By McNeil, Frey, et al. (2015, Theorem 8.28 (1)), L(A) = A'X has 
VaRa(L(A)) = A'u + VA'XA VaRo(Y) for Y ~ S4(v) (univariate spherical with generator 


p). 


By the condition E(L(A)) = A'u = c and since A'ZA = var(L(A)), we obtain that 


VaRa(L(A)) = c + V/var(L(A)) VaRo(Y); note that a > 0.5 implies that VaR4(Y) > 0. This 


implies that value-at-risk and standard deviation are proportional on M and the result follows. 


Solution 8.7 (On correlation adjusted summation in risk aggregation) 


a) 


g 


c) 


A strength of the simple summation formula is simplicity; there is no need to specify the 
‘correlation’ parameters p;j. On the other hand dependencies and diversification effects 
cannot be accounted for which is possible in the correlation adjusted summation formula 
(albeit in an ad-hoc manner). Note that the simple summation formula is a special case of 
the correlation adjusted summation formula (set pi; = 1 for all i,7 to see that). 

It should be noted that outside the elliptical world both formulas are rules-based and not 
principles based. 


Let EC = ES, (ZL) and EC; = ESa (Lj), j € (1,..., d). Then, the simple summation formula 
is conservative since subadditivity of ES, implies that 


d d d 
EC = ESa (L) = ESa( X £3) < X` ESa(L;) = S EC, 
j=l j=l j=l 


so taking EC to be Ns EC; is always conservative in this case. However, if VaRq is 
used instead, then VaR can be superadditive in which case the inequality would be in the 
other direction and so the simple summation formula would not be conservative (rather 
underestimating the required capital). 


No. For v1 Æ o» the lognormal losses Lı and Lə are not equal in type, so that the maximal 
attainable correlation is strictly smaller than one; see MFE (2015, Example 7.29). 


Solution 8.8 (Economic properties of a capital allocation principle) 


a) 


If one uses a risk adjusted performance measure of the form RORAC; — E(—L;)/ AC;, that 
is the ratio of the expected return of business line j and the allocated economic capital, 
one needs to determine AC;. To this end, capital allocation principles are used to take into 
account the relation of L; and L in an appropriate way. 


The Euler capital allocation principle associated with o has risk contributions 
Ór 
AC? = STE. Taped, 
j75 A ) I<j< 


where r(A) = p( Ba Ass. RORAC compatibility implies that the Euler principle 
gives correct signals for RORAC-based performance measurement and the existence of a 
diversification benefit means that the Euler allocation rewards diversification if based on 
some coherent risk measure; see MFE (2015, Section 8.5.3) for details. 


221 


Ex. p. 61 


Ex. p. 62 


Ex. p. 62 


Ex. p. 63 


Ex. p. 63 


Ex. p. 63 


8 Solutions to Aggregate Risk 


Basic 


Solution 8.9 (Acceptance sets for generalized scenarios) 

First note that the generalized scenario risk measure o is monotone and translation invariant. 
The acceptance set is given by A; = {L € M : o(L) € 0) where M denotes the set of all 
random variables. Therefore, Ag = {L € M : EO:(L) < 0, E92(L) < 0). Since E@(L) = 
L(w 1) Qi {w} + L(w2)Q1((w2)) = 0.61; + (1 — 0.6) = 0.61, + 0.412 and, similarly, E22(L) = 
L(w1)Q2(1«1]) + L(w2)Q2(1w2)) = 0.31, + 0.712, A, can be identified with 


{l € R? : 0.61; -- 0.415 < 0, 0.31, -0.7l < 0} = {l € RÊ? : la < —(3/2), l2 € —(3/7)] 
= {LE R°: ly € min{—(3/2)h, —(3/7)}}, 


which describes a cone in R?: lz < —(3/7)l; if lı < 0 and Ig < —(3/2)l; if lj > 0. 


Solution 8.10 (Convexity of coherent risk measures) 


a) Let o be a coherent risk measure. Then, by subadditivity and positive-homogeneity, 
o(ALy + (1 — A)L2) < o(AL1) + o((1 — A)L2) = Ao(L1) + (d — A)e(£2) 


for all A € [0,1]. Therefore, o is convex. 

b) Let o be a positive-homogeneous convex risk measure. By definition, o is monotone, 
translation invariant and positive-homogeneous. We have thus left to show that go is 
subadditive. To this end, let Li, Ls be two risks. By positive homogeneity and convexity, 


e(Li + L2) = o(2(11/2  L2/2)) = 2e(L1/2 + L2/2) € 2(0(11)/2 + 0(12)/2) 
= o(L1) + (L2) 


which proves subadditivity. 


c) Risk measures based on strictly convex loss functions are convex but not coherent. For 
another example, start from an arbitrary coherent risk measure o and define o(L) = o(L)+1; 
0 is convex but not positive-homogeneous and thus not coherent. 


Solution 8.11 (Standard deviation principle as a risk measure) 

'Iranslation invariance and positive-homogeneity are straightforward to check. For subadditivity, 
note that for two random variables Li, L2 with finite second moment it holds that var(L1-- L3) = 
var(L1) + var(L2) + 2cov(Li, L2) and cov(Li, L3) = corr(Li, Lz) sá(L1) sd(L2). Thus 


sd(Lı + L3) = yvar(Lı + L2) = /var(L1) + var(L3) + 2 corr( L1, L3) sd(L1) sd(L2) 


/var(L1) + var(La) + 2sd(L1) sd(L2) = y/(sd(L1) + sd(L))? 
sd(L1) + sd(L2) 


IA 


and, since E(L4 + L3) = E(L1) + E(L2), subadditivity of the standard deviation principle 
follows. 
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For a counterexample for monotonicity, consider Li, Lə as in the hint. Clearly, o(L1) = 0 
and L3 < Li. We need to show that o(L2) > o(L1). A straightforward computation gives that 


1 1 1 1 
o(L2) = CD 25 14 kyn 1) 


n n 


which is positive for n > 1 + 1/K?, so that o(L3) > 0 = o(L1) for such n. 


Solution 8.12 (Standardized and strictly convex risk measures) 
Consider some A > 1 and let a = 1/A € (0,1). Since o is strictly convex it holds that 


Aol L) = Ag(o(AL)) = Aela(AL) + (1 — a)0) < Aao( AL) + (1—0)o(0) — @(AL). 


e(0)=0 


This inequality shows that certain strictly convex risk measures punish concentration risk in 
the following sense: if one scales up a position by a factor A > 1 one needs to scale up the risk 
capital by more than A to make the larger position acceptable. If a risk measure is positive 
homogeneous — as any coherent risk measure is — such a punishment of concentration risk is 
not possible. 


Solution 8.13 (Monotonicity of expectiles in the confidence level) 
By MFE (2015, Proposition 8.23), eg(L) = Fy (o) for some distribution function Fg, which is 
increasing (non-decreasing) in the confidence level a. 


Solution 8.14 (Expectiles of the standard uniform distribution) 
a) By definition, e4(L) is the unique solution y of aE((Z — y)*) = (1 — a)E((L — y)~). By 
using that x = xt — x7, so x7 = xt — 2, we obtain that 


aE((L — y)*) = (1 — e(E(L — y)*) — (E(L) — y)). 
Subtracting (1 — a)E((Z — y)*) from both sides, we have 
(2a — 1)E((L — y)*) = (1 — a)y - (1 — o)E(L) 


and thus y = E(L) + 22 K((L — y)*). Using z^ = x + z^, we obtain that E((L — y)*) = 
E(L) — y--IE((L — y)-) and thus y = E(L) + 2221 (E(D) — y--IE((L — y)-)). This is equivalent 


l-a 
to (1+ y = (1+ PEEL) + AQE((L—y)~) and thus y = E(D) + °=*E((L —y)7). 
b) i) Since E((L — y)-) = [T (—(I — y)) dl = y?/2, we obtain e4(L) as a solution y of 
Aspen BRE. 4, 1,2a-1y? 
y = EQ) + P Igqu yy) = 5 + BE 


and thus, by multiplication with a, as a solution of (a — 1/2)y? — ay + a/2 = 0. The 
two candidate solutions of this quadratic equation in y are 


at ya? —-4(a—1/2)2/2 ac va - o 1X yl/oa-1 


VT 2a—1 2a—-1 2-l/a 
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Consider a > 1/2. D/G 2— s a is increasing in a and \/1/a—1 is decreasing in a, 
y+ = (1 V/1/a — 1)/(2 — 1/a) is decreasing in a and thus y+ cannot be an expectile; 
see Tab 8.13. a ius obtain that 


La al (Si) 


ood aD TEES 


for L ~ U(0, 1) and o € (0, 1). 
ii) By MFE (2015, Proposition 8.23), e4(L) = Ff (o) for 


Eo) = yFr(y) — J” s dFr (a) _ yF ry) — fo xdr 
DY) YFL) [8.4 Fi(a)) +E) -y  2yFi(y) — fd cda) +1/2—y 
y? — y*/2 = UP 


Wy? —y?/2)+1/2-y y-1/2-y 


We see that F,(y) = o if and only if (a — 1/2)y? — ay + a/2 = 0 and the solution (S1) 
follows as in i). 
c) For Fr being the distribution function of U(a, 5), we have L = a + (b — a)L for L  U(0, 1) 


with e,(L) as given in (S1). Since expectiles are coherent risk measures (in particular, 
translation invariant and positive-homogenenous), 


— V/l/a—1 


e«(L) = a+ (b — ajes(L) = a + (b — a) 2— 1/a 


Ex. p. 64 Solution 8.15 (Expectiles for exponential and Pareto risks) 
a) By Exercise 8.14 a), e4(L) is the solution y of 


20 d E((L — y)*) =E(L) + a = [6 — y)Aexp(- Al) dl 


y — E(L) + 


l-a 
1 2a-1 1 1 se 

-5+2 (| x y) exp(— an) + = 3! exp(—Al) at) 
1 

B 


m 1 1 2a —1 
4 m a(o 04 à exp( w) = -Gi ar exp( x) 


1 


or ày — 1 = 2a- 1 = exp(—Ay). Equivalently, (Ay — 1) exp(Ay — 1) = a ae With z = Ay — 1, 


we obtain eae jus fae and thus z = W (deze). where W denotes the Lambert W 


function, so eg(L) = y = (1+ Wi 2- a z))/A. 


b) Let L ~ Pa(@,«). We first derive an me for general 0. Note that Fr(r)-—1- (k/(k + 
z))9, x > 0, for 0 > 1 (so E(L) = &/(0 — 1) < oo) and & > 0. The corresponding density is 
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fr(z) = 0k? (r + x) C+). By integration by parts, 


a(E-y)*)= f (fiat = e«t f (1 — y)(« 4 D) (99 al 
y y 
1 oo 
nipote na 
serene] esf een 
= ets [p 0 4- ! (K+)? 4 = ety” —041 
0 1-0 
Exercise 8.14 a) implies that ea(L) is the solution y of 
— EF 2o —1,. EE NE iat Ke ww 
y=E(L) + [—E(L-9)*)= 9 + 125-16 *9 
JEN a ey 
0—1 


andi XL NT 1 


(k +y)! which is equivalent to (y — &)(& 4- y) = 472044 
and thus y? = k?a/(1 — a) = e ia — 1). Since expectiles need to be increasing in a, see 
Exercise 8.13, only the positive solution y = &/4/1/« — 1 provides a valid expectile and 
thus eg(L) = &/4A/1/a — 1. 

Note. As Bellini and Di Bernardino (2017, Proposition 2.3) show, for any Fr(r) = x- V L(z) 
for L € Ro and £ € (0,1), lima. ,— XC = (1/€ — 1)-5. This can be used to approximate 


eal L) for large a € (0,1) and L ~ ~ Pa(f, n in case 0 € (1, œ0)\{2}. 


Solution 8.16 (Stress test risk measure) 
Monotonicity, translation invariance and positive-homogeneity are straightforward to check. 
For subadditivity consider two losses Lı = lı (x1) and Lə = lo(22). Then 


os(Lı + Lo) = sup{ (l + l3)(z) : x € S} € sup{lı (x) : x € S} + sup{lə(x) : x € S} 
= os(Lı) + os(Lə2), 


where the inequality holds since the suprema on the right-hand side can be attained for different 
T: 


Solution 8.17 (Euler principle for multivariate normal risks) 
Since (L1,..., La) ~ Na(0, £) it follows that a AjLj ~ N(0, XZA) which implies that 


resa (A) = ES. (L(X)) = po mee 
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Ex. p. 64 


Ex. p. 64 
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Now & ANEA = 2EA so that p; YA = 2(NA); and thus 


Ó 
9A te (A) 


note that L — L(1). 


_ $(97(a)) 8 j|. $(97(9)12(23); 
xd- Ren aN eem “tee 2 JATA Acl 
_ é(9 !(o) (X1); — ó(9 (a) Dhar cov(Lj, Lu) 

l-a VIl l-a sd(L(1)) 
$(9 '(o))cov(L; L(1) _ ó(9^ (o)) cov(L;, D). 
l-a sd(L(1)) l-a sd(L) ' 


Note. We already know from MFE (2015, Section 8.5.2) that expected shortfall contributions 


take the form EC; — 


E(L;|L > VaRa(L)), so the above is an explicit form of this under the 


assumption (L,..., La) ~ N4(0, X). 
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Solution 8.18 (Dual representation of convex risk measures) 

We follow the steps in MFE (2015, Example 8.12). Let ¢, = L(wi), k € (1,2). Since 
o(L) = log E(e”) = log((e^ + e'2)/2), the set of points where o(L) = 0 is given by (£ = (1,42) : 
log((e + e'2)/2) = 0) = (£: fo(€) = 0) for f,(£) = e + e'? — 2, which equals {(4, £2) : £9 = 
log(2 — e^)). The acceptance set can thus be identified with A, = {(¢1, £3) : £9 < log(2 — e^). 
To construct Qr, consider, for example, 41 = 0 and thus £2 = log(2—e“!) = 0. The normal vector 
u of the supporting hyperplane in £ = (0,0) is u = V fo(€)|e=(0,0) = (e^, e?) |i. (o,0) Eu. 
and thus q = u/||u|| = (1, 1)/ 2 identifies the measure Qz. 


Solution 8.19 (Representation of ES) 


a) By definition of elementary conditional expectation one has 


TVaR4(L) = 


E(LItp.vamo(py) — E(LI trs vans (1))) 


} 
P(L > VaR,(L)) | 1- Fr(VaR4(L)) 


The claim now immediately follows from (E1). 
For the interpretation, note that 


gfe) - Tad) Ten 


0 =1 


l-a l-a ^" 1l-« 


and thus Fr(Ff-(o))/(1— o) € [0,1] and 1 — FL (FẸ (0))/(1 — o) € [0,1]. Hence ESa(L) is 
a convex combination between TVaR4(L) and VaR4(L). 
b) If P(L = Fý (a)) = 0, then Fr(Ff-(o)) = 1 — a and it follows from (E1) that 


ESq(L) = 
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E(Llttsre(a)}) + FE (a)(L-a-(1—a@))  E(L is ri ()) 


l-a l-a 


= 


(a) 
(Lr re (ay) 
l-a j 


Advanced 


E(Llgpzr-(o) = EFE (e)Muzrrz(9j) = FE (@)P(L = 
B(LItL=Ff (a)})/P(L = F$ (a)). It follows from the definition 


a)) and thus Ff (o) = 
} j that 


: P(L = Fý (a)) > 0, then 
Fr ( 
(o 

of I 


1—«-— F,(Fi (a)) 
P(L = Ff (a 
= E(LIrsre(a)}) + (17 a — Fr( 


E(LItpsF-(0)) + E(LItr-F-(o)) 


nee » 
(LL ps pea) = 


) 
Ff (a)))FE 


Therefore, also in this case, ES,(L) = (LD) pets /ü — a). 


Solution 8.20 (CoVaR and systemic risk) 

a) CoVaRa, g(Lə | L1) is the VaRq of the conditional distribution of Lz given L1 > VaRg(L1) 
so given that Lı is stressed, that is attains a value in its tail. For example if Lə is the loss 
of the financial system as a whole and Lı the loss of a firm in this financial system, then 
CoVaRq,g(L2|L1) provides the system's VaR given that the firm has a loss beyond its 
VaRg. 

A CoVaRa( Lə | L1) measures the contribution to the risk of Lə if Lı changes from the 
common scenario Lı > VaRos(Li) to the stressed scenario Lı > VaRa(Lı). In the 
aforementioned example, A CoVaRa(Lə | L1) measures the contribution of the firm's to 
systemic risk, that is how much the financial system's loss Lə increases because of the firm's 
stress. 


b) If (Z1, L3) ~ F with margins F1, Fo and copula C, then, for all x in the domain of F>, 


P(Li > FÉ (8), L2 < x) 
P(Li > Ft (B)) 


Fro|Ly>VaRg(L1)(£) = P(Lo € x | Lı > FÉ (8)) = 


_ P(L2 < z)-P(L < FF (8), L2 < x)  F»(x)- FFF (8), x) 
P(FAQA) = B) 7 1-8 
_ F(x) - C(B, F(a) 
1-8 f 


c) With C(u) = yM (u) + (1 — »)IHI(u), u € [0,1]?, where M and II denote the comonotonicity 
and the independence copula, respectively, and y € [0,1], we obtain from b) that 


_ PFY(z) — y min(B, Fo(x)} — (1 — 7) BFa(2) 
Fri S VeRa(AY (5) = 1- f 
i F»(x)(1 {ae B x F»(a), 
DERI. B > Pola), 
_ c ca T» Fi (B), 
Fo(z)(1— v), a < Fj (B); 
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note that this is a continuous function in z as the one-sided limits for x + F$ (8) coincide. 


For x = F3 (B), we have Fh o|£1>VaRg(L1) (2) = (1— y). By inverting Fr P VeRg(L (2) = 
a for a € (0, (1 — y)6] and for a € ((1 — y)8,1), we thus obtain that 


CoVaRa,s(L» | L1) = For svansqa) (9) H fe + 
2 


note that CoVaRa,g(Lə | L1) is continuous in a. 

For y = 0, we obtain CoVaR4,s(Lo|L1) = Fy (a) = VaRa(L2) for all 8 € (0,1) 
which confirms the intuition that Lı has no influence on Lə. For y = 1, one has that 
CoVaRo,a(L2 | L1) = F$ (6+(1-B)a) = VaR5 (1. go (L2), so for fixed a, CoVaR, a(L» | L1) 
can equal VaR(L2) at any confidence level in (o, 1) for a suitable 6 € (0,1) or, for fixed 
B, CoVaRo,g(L2| L1) can equal VaR(L2) at any confidence level in (8,1) for a suitable 
a € (0, 1). 

With PF5(x) = 1 — exp(—Az), x > 0, so F$ (u) = —log(1 — u)/A, u € (0,1), we obtain 
from c) that 


CoVaR,, a(L» | Lı) = | A 


Therefore, 


1 
CoVaRo,a(L2 | L1) = X los (5 = 


and 


l-a 
lc» 


1 
CoVaRao.5(L2 | Li) = RA log( 


For all a € (1/2,1) and (thus) all y € [0, 1], we obtain that 


A CoVaRa(L2| L1) = ; (s(; eO (5 =) 


(acqua. 


For y = 0, we obtain A CoVaRa(Lə| L1) = 0 for all a € (1/2, 1) which confirms the 
intuition that Lı has no influence on Ly. For y = 1, we obtain A CoVaRa(L»| L1) = 
— log(2(1 — a))/X for all o € (1/2, 1) which, for suitable o € (1/2,1) or A > 0, can take on 
any positive value. 
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Solution 8.21 (Superadditivity of VaR under worst-case dependence) Ex. p. 66 
a) By the Law of Total Probability, P(Lo < x) = P(Lo € zx, Lı < a) + P(Lo € z, L1 >a). By 

continuity, the first summand equals P(L; € x, L1 < a) = P(L4 € min{z,a}) = min{z,a} 

for all x € [0,1]. For the second summand, note that 1+ a — z > o for all x € [0,1], so that 


P(Lo < x, Lı >a) = P(1 +a — Lı < z, Lı >a)=P( >1+a—2,L; >a) 
= P(L; > 1 +a- x) = max{1 — (1 +a — x),0} = max{z — a, 0}. 


Therefore, 


x+0=rx, rao, 
Fr,(x) = min{z, a} + max(x — a,0} -| 
Aa+tr®-a=%, ra, 


=x, a S|0; 1}, 


so Lz ~ U(0, 1). 
Note. Alternatively, P(L9 € x) = P(Li € x) = z for x € a and P(Lo < x) = P(Lo € o) + 
P(a < Lg € x) = P(L1 € a)+P(a < 1+a-L € x) = a+P(Lı > 1-a—-z) = a+(x—a) = x 
for d > a 

b) Let o € (0, 1) and e € (0, (1 — a)/2). Note that 


214, I4 «a, 


RERE 
A B Li, 


and that 2a < 1 + o for all a € (0,1). Hence, for all x € R, the Law of Total Probability 
implies that 


Fy,41,(£) = P(Li + Lo < x£) =PQh < x, Li < a) + P(1 +a < z, Lı >a) 
= P(L1 < min(z/2,0)) + Iraca} Pa > a) 
= max(min(z/2, a), 0) + Isace)(1— 0) 


0, x <0, 

z/2, «x € [0,2a), 

Q, x € [20,1 +a), 
1, t> Ilta: 


Since £ € (0, (1 — a)/2), we have a + £ € (a, (1 + a)/2) and thus 
VaRa+e(L1 + Lo) = Ff p (& + £) = 1 +a > 2(a + €) = VaRa+e(L1) + VaRa e (L5), 


in particular VaRa+e is superadditive. 


229 


8 Solutions to Aggregate Risk 


c) Since L4 ~ U(0, 1) and Lə ~ U(0, 1), we have 
VaRote(S2) = inf [Fr (o e 2) - Fz!(1—2)) 


= inf {a+e+r+1-zr}=1+a+e. 
z€[0,1—o--e] 


Hence for € > 0+, VaRa+e(Lı + L2) = 1 + a is arbitrarily close to the upper bound 
VaRoa+e(S2); this is why we refer to the joint distribution of Lı and Lə as worst-case 
dependence. 


Ex. p. 66 Solution 8.22 (A proof of subadditivity of ES based on Hoffding’s Lemma) 
a) Since E(|L|) < oo, ES,(L) exists and we have 


1 1 1 1 1 
ESS(L) = Da FẸ (u) du = Ei F (ulf(uaj:l1du- EF E(FE (U)tuoj) 
-| RLY). 
l-a 


By definition, Y = Iyysq} ~ B(1,1 — a) and L and Y are comontone. For any other 
random pair (L,Y) with Y ~ B(1,1 — o) (so any other copula between random variables 
with marginal distributions Fr, and B(1,1 — o), respectively), Hoffding’s Lemma implies 
that cov(L, Y) € cov( L, Y) and so 


E(LY) = cov(L, Y) + E(L)E(Y) < cov(L, Y) + E(D)E(Y) = E(LY); 


note that the appearing covariances exist here since E(|LZ|) = E(|L| -|Z]) € E(|L]) < oo 
for any random variable Z with Z € [0,1] almost surely. Therefore, ES,(L) is the largest 
E(LY)/(1 — a) and thus 


ESa(L) = sup{E(LY) : Y ~ B(1,1—a)} 


l-a 
as claimed. 
b) For L; with E(|L;|) < oo, j € {1,2}, we have 


(1— a) ESa(Lı + L3) " sup{ E((E + L3)Y): Y ~ B(1,1— a)} 


sup{ E(LiY) +E(LoY) : Ý ~ B(1,1— a)} 
sup{ E(Z4Y) : Ý ~ B(1,1— a)} 


+ sup{ E(LoY) : Ý ~ B(1,1 — a)} 


5 Qe) ESa(L1) + (1 — a) ESa(L2) 


and thus that expected shortfall is subadditive. 


IA 


Ex. p. 66 Solution 8.23 (A basic version of the rearrangement algorithm) 
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a) To mimic comonotonicity (C = M) in the joint tail of (L1, L2, L3) the ranks of the entries 
in each column have to be aligned, so the smallest realizations of tail losses of L1, of Lə 
and of L3 have to lie next to each other, the second-smallest have to lie next to each other, 
and so on. One such configuration (the order of the elements in the first column does not 
matter) is obtained by sorting each column of A in increasing order: 


o E o ud 
Nawe 
o Hm Ne 


As smallest minimal row sum, the corresponding VaRa(L) estimate is thus VaRa(L) = 3 
(the sum of the first row). To mimic C = II, each column of A would need to be permuted 
randomly. 
Note. Randomly permuting the columns would only lead to VaRa(L) = 3 with probability 
(1/4)? = 1/64. With probability 1 — 1/64 ~ 98.44% one would obtain a larger VaRa(L) 
estimate VaRa(L). We thus see that VaRa(L) under comonotonicity can be smaller than 
VaRa(L) under independence; see also MFE (2015, Example 8.40). 

b) i) Iterating over the columns and rearranging one column at a time (oppositely to the sum 

over all other columns), we obtain 


23 8 13 8 1 18 
4 1 1 not stable 4 1 1 stable 4 5 1 
15 2| Jay i35 2 a |3 7 2 
374 rai) 274 (3) 234 

1158 118 

4 5 2 not stable 3.5 2 

2s coge es sq AOS a y aije 

r() 234 ra-(i) 43 4 


where »; .; denotes the row sums over all but the jth column. Since all columns of the 
last matrix can be verified to be oppositely ordered with respect to the sum of all other 
columns, the matrix does not change anymore in further rearrangement steps and thus 
this basic RA terminates. The vector of row sums of the last matrix is (10, 10, 10, 11) and 


thus the minimal row sum equals 10. We therefore obtain the estimate VaR4(L) = 10 
of VaRa(L). 

Note. When ties appear in 5, ;, a stable sort algorithm keeps the order of those values in 
the jth column which correspond to the tied values in 5; j; an unstable sort algorithm 
does not necessarily. The above snapshots of this basic RA indicate where we applied 
such a stable sort; other choices are possible, ours here were made based on how 
rearrange() in the R package grmtools proceeds for this example. 
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ii) In the case C = M, we obtained VaR4(L) = 3, so a much smaller VaR4(L) estimate. In 


particular, C = M did not lead to the largest VaRa(L) estimate, or, to put it differently, 
the dependence which leads to VaR4(L) is not comonotonicity. 


Note. Figure S.8.1 shows a sample of size 256 from C which leads to VaR4(L) for a 
Pareto, log-normal and log-gamma margin and a = 0.99. Clearly, the sample cloud does 
not come from the M copula. 


Figure S.8.1 256 realizations from the worst-case copula C of the joint tail of (L1, Lo, L3) for 
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Fi, Fo, F3 being Pareto, log-normal and log-gamma distributions, respectively, 
and a = 0.99. 


iii) In each step of the RA, the opposite ordering of columns leads to a smaller (not necessarily 


strictly but at least not larger) variance of the row sums. This aims at concentrating 
more of the 1 — a probability mass of the distribution of L = Lı + Lo + La beyond 
VaRa(L) on a single point (if it exists analytically, such as under the assumptions in 
the hint, this point would be ES4(L)) which aims at dragging out the current VaRa(L) 
estimate further to the right, so increase it; see Figure $.8.2 which shows different 
distributions of the 1 — o probability mass of L = Lı + L2 + L3 beyond VaRq(L). 

Suppose the assumptions of the hint to hold. If all row sums are equal, the variance 
of L| L > VaR4(L) would (approximately) reach 0 and thus L| L > VaR (ZL) would 
equal its expected value, so ES,(L). Also, VaR4(L) would be equal to ES4(L) in this 
case, the largest possible value (since VaRa(L) < ESS(L)) and thus in line with the aim 
of the rearrangement algorithm. 


Advanced 


T 
VaR, (L) ES,(L) 


Figure S.8.2 Visualization of concentrating more of the 1—a probability mass of the distribution 


of L = Lı + Lə + La beyond VaRa(L) on a single point by decreasing the variance 
of L through column rearrangements in order to increase the current VaRq(L) 
estimate in the RA. 


We have 
1353 E: 
222 $222] 
333/X-(i)uss 


which satisfies that all columns are oppositely ordered to the sum of all others. The vector 
of row sums of the last matrix is (5,6, 7) and thus the minimal row sum equals 5. We 


therefore obtain the estimate VaR4(L) = 5 of VaR4(L). However, this is not the largest 
possible VaR4(L) estimate VaRa(L) since the matrix 


1 2 
2 3 
3 1 


Ne C2 


has row sums (6,6,6) and thus VaRa(L) EDI. 
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Review 


Solution 9.1 (Mapping and loss operator) 

The mapping is the function which maps a vector of risk-factor changes to the corresponding 
value of a portfolio. The loss operator is the function which maps the vector of risk-factor 
changes to the corresponding portfolio losses. 


Solution 9.2 (Unconditional and conditional approaches to market risk measurement) 

In the conditional approach to market risk measurement, the distribution to estimate is the 
conditional distribution function of risk-factor changes Fx, ,,|z,, so the distribution function of 
the vector of the one-period risk-factor changes X;+ given all information F; = c((X,: s < t]) 
up to and including time t. We then compute risk measures based on /;(.X1,1) for Xi44 ~ 
Fx, |z} In the unconditional approach, the multivariate time series (Xs)s<t of risk-factor 
changes is assumed to be stationary and we estimate its stationary distribution function Fx 
to compute the (unconditional) losses via (X) for X ~ Fx. The unconditional and the 
conditional approach coincide if the multivariate risk-factor changes are iid. 


Strengths (‘+’) and weaknesses (‘—’) of the conditional over the unconditional approach are: 


+ Uses all the information available up to and including time t. 
+ A natural approach for market risk measurement. 
+ More in line with empirical properties of financial data. 


— Not always a prudent assessment of risk (may underestimate the possibility of extreme losses 
in quiet periods of financial markets). 


— Because of the lack of the stationarity assumption of the multivariate time series of risk-factor 


changes, the theoretical treatment of the conditional approach is typically more complicated. 


Solution 9.3 (Comparison of variance—covariance, historical simulation and Monte Carlo) 
We list possible advantages (‘+’) and disadvantages (‘—’) for each of the methods. 

= Variance-covariance method: 

+ The loss distribution function Fr,,, of L441 is known explicitly. 


+ Being univariate normal, F;,,, leads to explicit VAR4(L;4,1) and ES,(L441) estimators. 


— The assumption of multivariate normality of the vector of risk-factor changes X;41 is 
unrealistic. 
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+ The fact that Fj,,, is normal typically makes implementing this method straightforward. 
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— The assumption of the linearized loss distribution function F D to be a good approxi- 
mation to Fr,,, may be violated. 


= Historical simulation method: 


+ There is no estimation of the (unknown) distribution (dependence and marginal distribu- 
tions) of X,,, required since Fz, ,, is an empirical distribution function. 


+ Easy to implement since F7,,, is an empirical distribution function. 


+ Using the empirical distribution function of the past losses typically allows one to use the 
original loss operator (not a linearized one). 


— It only considers past risk-factor changes for evaluating the current loss operator, no 
future (or unseen) scenarios are included. 


— '[here is often also a lack of sufficient synchronized (joint) data which then leads to poor 
tail event estimates. 


— Full revaluation of the portfolio of interest is required for each historical vector of risk-factor 
changes, which can sometimes be computationally costly. 


Monte Carlo method: 


+ Quite general and flexible since any distribution for X,4,1 which can be easily simulated 
from can be used. 


— It is unclear how to find an appropriate model for X,,,. 


— 'The Monte Carlo method can also be time-consuming if the loss operator is difficult to 
evaluate (for example by simulation itself). 


Solution 9.4 (Historical simulation) 


a) 


Historical simulation uses the empirical distribution function of the losses as loss distribution 
function. The Glivenko-Cantelli Theorem says that if the losses are iid, their empirical 
distribution function almost surely converges uniformly (for all x € R) to the true underlying 
but unknown loss distribution function. 


Historical simulation is an unconditional method since, at time t, it estimates the loss 
distribution function as the empirical distribution function of lj (X543), -- -l IX?) which 
is a consistent estimator of the distribution function of jj (X) for the generic vector of 
risk-factor changes X ~ Fx. 

A univariate dynamic historical simulation consists of treating L; 44, = lig Xin); 
lt = lig (Xt) as stationary losses L, of the form L, = lUs + O5sZ5, 8 € {t—n+ 
1,...,t], where us and c; denote the (F;_1-measurable) conditional mean and volatility, 
respectively, and (Zs) ~ SWN(0,1); (us) and (os) can be estimated, for example, by 
fitting an ARMA — GARCH process to Ls, s € [t —n-F 1,...,t], and risk measures can be 
computed from the fitted (us) and (cs) based on a risk measure estimates computed from 
the corresponding standardized residuals. 

Note. A multivariate dynamic historical simulation consists of treating X, 441,..., X: as 
realizations of a process (X,) of the form X, = Hs + A,Z;, where (p) is a vector-valued 
process, (A,) for A, = diag(o.51,..., 0,4) is a process of diagonal matrices (both assumed 
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to be F,_;-measurable) and (Z,) ^» SWN(0, P) for a correlation matrix P; the components 
of (us) and (As) can be estimated by the conditional means and volatilities of fitted 
marginal time series models. The idea is then to apply historical simulation to the residuals 
ŽŽ, = ÂZ (X, — fis), s € {t— n+ 1,...,t}, and thus to treat L.— lig (fui + Asi Zs), 
sé€{t—n+l,...,t}, as observations of Dy41 = ljg (X+). 


Solution 9.5 (Elicitability and its relevance to model validation) 


a) 


c) 


Let o(Fr) :— o(L) for a law-invariant risk measure o. Computing a forecast of o( Fr) over 
the next time period is a point forecasting problem because Fr is unknown and one has to 


find a forecast Fy, of it to forecast the unknown true o(Fr) via the point forecast y = o( Fr). 


Since different Fr, can be used to forecast Q(Fr), we compare different point forecasts y 
with their forecasting error 


(SQ. D) = [ Sv.) aFL(), (81) 


where S(y, Ll) is a scoring (that is forecasting objective) function. For a given scoring function 
S, the optimal point forecast is 


o (Fr) = seu E(S(y, L)), (S2) 


so the y that minimizes (S1); for example, for S(y,1) = (y — 1)? and S(y,l) = |y — I|, the 
optimal point forecasts are the mean and median of FT, respectively. Elicitable risk measures 
are risk measures o which can be written in the form (82) for some scoring function S. 


The existence of a scoring function S for an elicitable risk measure o implies that this S 
can be used to compare different point forecasting procedures for o, so a decision based 
on 'the smaller the forecasting error, the better' makes sense. This allows one to compare 
forecasts yi, y2 of a risk measure o based on realized losses Li,..., Ln (for example via the 
approximation m S(yz, Li), k € {1, 2}). 

Note. If o is not elicitable, one cannot find such an S and thus the minimization of the 
forecasting error does not yield the true value o(F7) for any S. Hence, for two competing 
point forecasts of o( Fr), one cannot tell which performs better by comparing their forecasting 
error, no matter what S is used. 


VaR: yes; ES: no; expectile: yes. 


Solution 9.6 (VaR violations) 


a) 


b) 


The realized loss at time t is a VaR violation if it exceeds the estimate of VaRq computed 
from data up to and including time t — 1. 

For the ideal forecaster the time series of violations would be an independent process of 
Bernoulli variables with success probability 1 — a. 


Solution 9.7 (Factor models for bond portfolios) 
In the Nelson-Siegel approach the factor model is fitted to the yield curve data at time t. The 
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coefficients of the three factors have a prescribed functional form with a tuning parameter. The 
factors themselves are estimated by a cross-sectional regression using weighted least squares 
with an extra optimzation over the possible values of the tuning parameter. This is repeated at 
every time point t to get time series of the factors which can then be used to forecast changes 
in factor values in the future. 

In the PCA approach we begin with data on the changes in yields. PCA is applied to 
multivariate longitudinal time-series data, rather than cross-sectional data. The technique 
of PCA is used to construct sample principal components of the yield-change data; see also 
Exercise 6.32. The first few of these (usually three) are taken to form the time series of factors 
and these are modelled and used to explain changes in the yield curve over time. 


Basic 


Solution 9.8 (Mapping an interest-rate swap) 
a) Set t= 0 and V(0) = 0 to obtain 


4 (p(0, To) — p(0, Tn)) 


K = - 
Mii v(t, Ti) 


from the given formula. 


b) The IRS is a long-short portfolio of zero-coupon bonds and so we may use the detailed 
mapping of a bond portfolio in MFE (2015, Section 9.1.3) to obtain 


V; = g(Ti, Zi) -5 Ai exp( — (1i — 72) Zt), (S1) 


where Zii = y(t, Ti) is the yield of a ZCB with price p(7, T;) (the ith risk factor here) and 
—], i — 0, 
Ai = 4 K/A, i € (L,...,n— 1}, 
(4+ K)/A, i=n. 


Note that (S1) has the same form as MFE (2015, equation (9.12)). 
c) Since g(t, zt) = 35-9 Ai exp(—(Ti — 7%) 2,4) with 
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we obtain from MFE (2015, equation (9.7)) that the linearized loss operator is given by 


Hc) má (a(n z) At + 5 Jz: (Tt, 2) 
1=0 


—À ( 5 Ai exp(— (i — Tt) zti) zti At + 5 —(T; — n)Aiexp(- (T; — ys) 
i=0 i=0 


= — M Xexp(- (Ti — 74) 244) (24At — (Ti — )zi) 
i=0 


in terms of a risk-factor change x; compare with MFE (2015, equation (9.13)). 


Solution 9.9 (Mapping a currency forward) Ex. p. 70 

We set the nominal of the contract as N — 1. 

a) At time T' the holder of the long position in the forward has to pay e EUR; in return he 
acquires a position of one USD; the market value in EUR of this USD position is er. Hence 
the market value in EUR of the long position in the FX forward is Vr = er — e. 

For the second part note that the value in EUR at time T' of a portfolio consisting of 
one foreign and e domestic zero-coupon bonds equals Vr — ep — e and is thus equal to the 
payoff of the forward. 

b) The vector of risk factors consists of the logarithmic USD/EUR spot exchange rate log ei, 
the continuously compounded yields y'(r;, T) = —(1/(T — t)) log p'(r;, T) and yd (v, T) = 
—(1/(T — t))logp?(r;, T) of the EUR and the USD zero-coupon bond and is given by 
Z, = (log ei, y!(ri, T), y* (1, T)). The value of the position in the USD bond then equals 
Vi = exp(Zii — (T — 7))Zi2) and the value of the position in the EUR bond equals 
VA = €exp(—(T — 7%) Zia), which gives the mapping 


V; = Vi — V8 = exp(Zui — (T — 0)Z432) — eexp( -(T — Ti) Zt 3) =: (ns, Ze). 
c) Taking derivatives we get 
9. Qr, Zt) = Vf Za — Vf Zi, 
9a (Te, Zt) = Ve, 
(Te, Ze) = -(T— n) Vé, 
(Te, Zi) = (T = r) V. 


Gzo\Tt, Zi 


za Tt, Zt = 
The linearized loss is thus given by 


3 


Lb —— (s(n Z,)At + 5 Jz; Tt, Zi) X15) 
= 


= -((Vf Z2 - V$ Zus)At + Vf Xia — (T — T) Vf Xe + (T — n) VE Xii). 
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Ex. p. 70 Solution 9.10 (Asset liability management using duration and convexity) 


Ex. p. 71 


a) 


Denote by Vo = 1M 5.,€- 9?T the value of the liabilities at t = 0 and by D = 
—0.01T 


yy T their duration. The nominal values A; and A5 have to solve the equations 


—0.01 —0.05 
Ae 994 Ape OO = Vo and AE 5 ave — D. 
Vo Vo 


This is a (nonlinear) system of two equations for A; and A5 which is easily solved numerically. 
The solution is A1 7: 2.48M and A5 z 2.53M. 


Substituting into the formula defining convexity, see MFE (2015, equation (9.17)) we obtain 
a convexity of 10.9 for Strategy 1 and of 12.9 for Strategy 2. 


If the manager uses perfect matching (Strategy 1) the change in the overall portfolio value 
is always equal to zero, regardless of the yield-curve scenario considered. For Strategy 2 the 
change in value is positive in both scenarios: numerical computations show that if yields 
increase to 2%, the portfolio value increases by 471, and if yields drop to 0.1%, the portfolio 
value increases by 404. 


This conclusion is wrong: there are scenarios (yield curve changes) where Strategy 2 
generates a loss. A simple example is the case where y(0, 1), ..., y(0, 4) stay unchanged but 
y(0, 5) increases. Since, under Strategy 2, the combined portfolio (assets and liabilities) has 
a long position in p(0,5). 


Solution 9.11 (Variance-covariance method for multivariate t risk-factor changes) 


a) 


b) 


In contrast to the multivariate normal distribution, the multivariate Student £ distribution 
allows for more flexibility in the tails, in particular, it can be leptocurtic and capture tail 
dependence. 


Since, by Exercise 6.20 a), Ag + AX ~ talv, Ao + Ap, XZA), we have 
VaRa(Ao + A.X) = Ao + N+ V A'XAt, (a), 


VAPA V fa (t (0) tz (of), 


l-a v- 


ES4(Ao + AX) = 2o + A! 


where t, + denotes the quantile function of the standard Student t distribution with v degrees 
of freedom and density f+; see Exercise 2.16. 


The annualized volatility ca is the standard deviation of an instrument’s yearly log-returns. 
Motivated from the square-root-of-time rule (which is a matter of convention that is not 
supported by probabilistic arguments), the daily volatility øq is obtained by taking o,//250. 
The daily volatilities of the log-returns X41 = (Xt41,1, Xt41,2) at day t+1 of the two stocks 
are thus 9441 = 01/ /250 and 0432 = 02/ V250, respectively. Therefore, their covariance 
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matrix is 
5 .04 0.4-92...0.25 
" e eos 741 — Panda — 250 "^ 250 V250 
Xy TC t+1) = A paco os < = 4.0.2. 0.25 0.0625 
dou did 0.47350 250 250 


_ 1/16 8 

~ 108 \ 8 25)" 
We know from MFE (2015, Example 6.7) that Xx,,, = ;45» for the parameter (covariance) 
matrix X of the multivariate t distribution of X;,1. As such, 


y—2 2 [16 8 
X= y Pain ge (5 ae 
For this X, we thus have that Xt41 = (Xii, Xt41,2) ~ t2(v,0, £) for v = 5. As in 
Solution 2.10 and by Exercise 6.20, the daily one-period linearized loss random variable is 


Ley = —Vi(wi Xia + wasXiuia) = -Vw Xe ~ t(v, -Viw'0, (V) w Ew) 
= t(v, 0, VZ w £w), 


where w = (wi, w2) = (0.7,0.3) are the given portfolio weights. With V, = 1 it follows 
from b) that 


VaRoss(L,) = 0+ / V2w'Xw t; (0.99) = 30228.24, 


X V2w'Xw 
ESo.99(Li41) = 0 + 


- “ ; fiu (ty (0.99) (1. + tz (0.99)? /v) ~ 39997.59. 
d) They are of course consistent! 


Solution 9.12 (Sampling from an empirical distribution function) 

For X ~ Fn, P(X = Ley) = Falle) — Fall) = Fa(Lgy) — Fn(Eq-) = (k/n) — (k — 
1)/n) = 1/n for all k € {1,...,n}, so Fn puts mass 1/n on each of L4,..., Ln; note that this is 
still true if there are ties among L,..., Ln. Drawing from Fn thus corresponds to drawing 
each of L1,..., Ln with equal probability 1/n with replacement and thus randomly drawing 
from L4,..., Ln with replacement. 

Alternatively, we can also utilize the inversion method. We know from MFE (2015, Sec- 
tion 9.2.6) that Ff (u) = L(,4]), so by the inversion method, X = F; (U) = Linui ~ Fn 
for U ~ U(0,1). Note that [nU] ^* U({1,...,n}), so that X is a randomly drawn loss from 
Lea), ..., Lin) and thus from Ly,..., Ln. 


Solution 9.13 (Square-root-of-time rule for serially independent risks) 
a) By independence, L4 +--+ Lj ~ N(hu, ho?) and so 


VaRa(Li +--+ Lp) = hu + V ho29-! (a) = hu + V hoc! (o) (S1) 
= (h — vh)u + v'h(u- a (o) 
= (h — V h)u + Vh VaRo(L1). (S2) 
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ind. 


b) Let R ^" N(0.01, 0.04), t € {1,...,9}, denote the daily returns of the stock per share. The 
ind. 


corresponding losses based on 100 shares of the stock are thus L; = —100R, ^ N(—1, 400), 
t € {1,...,9}, with corresponding VaR4(L;) = —1 + 209-!(o). By a), 


VaRo(Li +++: + Lg) = (9 — V9)(—1) + V9(—1 + 20871 (a)) = —9 + 609 1(o). 


Alternatively, note that $2? 4 L; ~ N(9- (—1), 9-400) and thus VaRa(Lı +---+L9) = 
—9 + /9-4009-1(a) = —9 + 606-7! (a). 
c) By the Central Limit Theorem, approximately for sufficiently large h, 
1 h 
Sha Le — hype | pp Dre Lt) — y ~N 


ho? v o? 


and so, approximately for sufficiently large h, we have ERN Li ~ N(hp, ho?) and thus 


(0, 1) 


VaRa(L4 +--+ Ly) © hu + Vho® (a). 


Note. This coincides with (S1) but in general not with (S2) as Lı is not necessarily normal 
here. 


Solution 9.14 (Square-root-of-time rule for serially dependent, jointly normal risks) 
If L~ Ni(p, 3), Li +--+ Lj = VL ~ N(1'p, 171) and thus 


VaRa(L4 +- + Ln) = 1u + VV/E18 (a). 


It follows from L; ~ N(u, 0°), t € {1,...,h}, Yu = hu and 1'31 = o° 1 P1 = o?(h + h(h — 
1)p) = o7?h(1 + (h — 1)p) that 


VaRa(Li +++ + La) = hu + ov h(l + (h — 1)9)87 (o) 
= hp + ey h( + (h—1)p)((VaRa(L1) — 1/2) 
= (n- yh(1 + (h — 1)p))u + Vh + (h — 1)p) VaRa (Li). 


Solution 9.15 (Tests based on VaR violations) 
a) One would expect about 500(1 — 0.99) = 5 violations of VaRo.99, that is losses exceeding 
VaRo.99.- 


b) One could use a two-sided binomial test with test statistic T5090 = yo I L, 11» VaRÉ 99}? 
see MFE (2015, Section 9.3.1). Computing the p-value of this test (for example with R's 
binom.test()) leads to approximately 0.000646 as p-value and thus to a strong indication 
against the null hypothesis of having accurately estimated VaRo.99; see qrmtutorial.org. 
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c) The VaRo.99 violations build a sequence of Bernoulli trials. If the violations are iid, the 


spacings between them should be iid Geo(p) (on N) for p = 0.01. Checking this with a 
Kolmogorov-Smirnov tests based on the realizations 7, 54, 9, 5, 15, 40, 70, 45, 122, 4, 14, 
18, 3, 81 leads to a p-value of 0.0165; see qrmtutorial.org for this and more results. 


Solution 9.16 (Elicitability and comparison of VaR estimates) 


a) Figure S.9.1 displays graphs of SZ(y,l) as functions of l for y= 1 and a € {0.5,0.9}. For 


1 


Score function Say, I) at | for y 


Loss level | 


Figure S.9.1 Graphs of the score function S2(y,l) for VaRa(L) for y = 1 and a € {0.5,0.9}. 


a = 0.5, S2(y, l) = 0.5|l — y|, which is symmetric about y. For a = 0.9, however, $2 (y, l) is 
larger for | = y + h than for the corresponding point l = y — h for any h > 0 and so losses 
L > y are punished more than losses L < y. To minimize the prediction error (expected 
score), it is thus optimal to shift the prediction y to the right. 


We have 


12 12 
t 1 t 
12 J S§.99(VaRo 9o x; Lipi) = 0.0537 > 0.0508 = 12 J S§.99(VaRo.99 v, La), 
t=1 t=1 


so indeed the smaller forecasting error is produced by the more adequate model. 


Note. In real-world applications, the number of losses (12 here) should, of course, be much 
larger to be able to draw reasonable conclusions. 
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Advanced 


Solution 9.17 (Delta and delta-gamma approximation) 
a) The first-order Taylor series approximation of g(yi1) about yo is given by 
g(y1) = (yo) + Vg(yo)' (yı — yo). 


Therefore, —(g(y1) — g(yo)) = —Vg(yo)'(y1 — yo). With yı — yo = (At, a1,... 
Vg = (gr, Ja, .. es) we obtain that 


—(9(y1) p g(yo)) = — (9r (Tr, z), 92 (Te, z), p , Gas Tt; 2)) (At; Tl,» sa) 


= —(g7(%, z) At + 6(%, ZY £), 


where (Ti, Z) = (gz, (i, z),... 
this implies that 


LÀ, = -(g(Yi) - 9(¥o)) = — (9r (Te, Zn) At + (n, ZO) Xo). 


b) In terms of the Hessian matrix H — (x25 atta 
109j , 


mation of g(y1) about yo is given by 


g(yi) = 9(yo) + Vglyo) (yı — yo) + (y — yo) H (yo) (yı — Yo) 


and thus —(g(y1) — 9(yo)) = —(Vs(yo)'(yi — yo) + $i — Yo)’ H (yo)(yi — vo))- 
Yi — yo = (At z1,..., xq), Vg = (95.925... 94,) and 
Grr Grey Irza 
H = 4 nd E * Pues 
Izar Jzazı Gzaza 
we obtain that 
1 
gn — qo) H (yo)(y1 — wo) 
GAGE z) Jrz (Ti, z) e Tz (Ti, z) 
- PAS. a) gar (rt z) Jaz ("es z) ‘ . Susa (Tt z) E 
: : : c 
DN Tes z) Üzaqz (Te, z) oe Izaza Tes z) 


a! Grr (Tt, z) 
(At, 2) ee 


wln, zy \ [At 
T(m, z) x 


(Grr(7, z)(At)? + x'w, z)At + Atw(n, z) x + £T, z)a) 


NIe NIe HI-A|A 


(Grr (Ti, Z)(At)? + 2w(7, zm At + a T (ni, z)æ) 
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|a, (Tt, Z)). For Yo = (t, Z+) and Y = (44+ At, Zi + X+41), 


the second-order Taylor series approxi- 


Advanced 


where w(71, 2) = (ga c(Tt; Z), ++ +5 Gear (Tt, 2)) and T(m, z) = (25 ani. 2) Therefore, 
— (g(u1) — g(y0)) 
1 
= (s(n 2)At + (ni z)'a + z (ge (ri 2) (At)? + 2 (ni, z) At + wT (re, 220 


For Yo = (m, Zi) and Yi = (% + At, Z + Xi44), this implies that 
Li, = —(9(¥1) - «(Yo)) 


Ex EE (s(n, Z,)At + O(t, Zi) Xia 
1 
T 2 (ges Cre Zi) At)? + 2w(Tt, Zi) Xa At + Xl (ms, 2)X«)) l 


Solution 9.18 (Standard methods for market risk) Ex. p. 74 
R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 9.19 (Standard methods for an option position) Ex. p. 74 
a) Solving 0 = ASo — PPS (ro, S5; r, oo, K, T) = + 942.46 — PPS(0, 942.46; 0.01, 0.3018, 1050, 5) 

with respect to A, we obtain A œ~ 0.3030. 
b) With Z = (441, Zi2) for Zi1 = log(S$r1), Zio = ex and with X;,1 = Z41 — Zi, we have 

that 

VW = AS, of PP Cay St; T, Ot, K, T) 
= Aexp(Zi1) — PPS (m, exp(Z+1); r, Zuo, K, T) = g( Ti, Ze). 
We know from MFE (2015, Sections 9.1.1 and 9.1.2, equations (9.7) and (9.9)) that the 


loss operator liy (@) = —(g9(Te+1, Zt + £) — g(Te, z:)) leads to the linear and quadratic loss 
operators 


If} (a) = —(97(Tt, zi) At + (ri 24)'2), 


1 
If (a) = —(9¢(7 zr) At + OlT, Z)'a@ + gt Tn, xa), 


where, for At = 1/250 and s;1 = e, 
Or (Tt, 2) = -PPS At = — PPS /250, 


Gz, (TH, z Ae?tà — PBS 67,1 A — PBS) e%t1 A— PB®)s 
rca (al) = (gO) OAPI), 


o 


T'(7; ze) = Jaz (Tt, zt) Jazo (Tt, Zt) E —(e**1)? PBS + (A - PE95)esa —PBSena 
Jez (Tis zt) Üzozo (Ti, Zt) — PPS e% — PBS 


2 pBS BS BS 
E — 841 PSS Tuc — Ps”) se P5 $41 : 
’ 
=e St,1 —P. 
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note that as the time horizon of interest is one day and the time unit in the Black-Scholes 
formula is in years, we have 7; = t/250 and thus 7; = 1/250 = At. The Greeks — PBS 
(theta), PBS (delta), PBS (vega), PBS (gamma), PBS = PBS (vanna), PBS (vomma) can be 
derived from 


PBS (7, Sr, Op K,T) = Ke" (7-70 (—d, 5) — S,®(—di1) 
where 


. log(Si/ K) + (r + o2/2)(T — m) 
d ci T — Ti 


l K — 2 — 
di = osts Ht s gil REST S dii — oT — m. 
t — NT 


A rather tedious than complicated calculation shows that 


theta: —PPS 2 p Ke-' (7-70 ( d, 5) 


DT =m 
delta: pre = —$(-—di1), 


vega: PPS = Sildi) VT — m, 

gamma: PBS = é(dii)/(SioiV/T — m), 

vanna: PBS = — (d; 1)dia/oi1, 

vomma: PBS _ Sio (di, )VT- Se, 
t 


which fully determines the linear and quadratic loss operators. 


Note. As examples, we derive the delta and the vega here. For simplicity, let a prime denote 
the derivative with respect to the variable under consideration. For the delta, we have 


PRS = - Ke T- 6(—dy 9) di, 9 — (B(—di1) — Sepl—di1)d1) 
= —9(-dq) — (Ke T- p(—di 2)d a — Sió(—de)d,). 


Since d; > = d4, we have 


Pee = —®(—d;1) = (Ke "(7779 b(—dy 3)d, 2 Sib(—dt,1)d} 1) 
L1 da diy d? 
= $( di1) R 2 (Ke r(T Tt) e 2 Ox t. St) 


d2 
E a. 


By using that $ (dr, — di,2)(di,1 + de2) = jory T — T2 SOLU) — tog (5,/K) -r(T — 


Tt), we obtain that 


/ a? 
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For the vega, we use that 


; o(T — n)a T =T — (log(Si/ K) + (r + 07 /2)(T — ))/T —m 


"e o7(T — m) 
log(.S;/K) + (r + o2/2)(T — t) dta 
— ST = ST — oe 
" of /T =t a Ot 


and that di. = di, — VT — 1% = —di4/o, to see that 
t2 = 41 , 


PBS = _Ke"(?-) 6(—dro)di.o + Sud (—de1)dz = a + $16(de1) di, 
= Ke 2-8) e(d, J% 


T—7-— 


= Soldi) V T — m + neci = d 

= $:6(dt1) VT — m + —— à 2 Ke" (0770(d,i — cuv T — n) — Sio(di)) 

= Sió(di) T p — [Ree (dtr — Otv T — 71) sı) i 
t 


ó(di,1) 
Since 
(dea — oT — t) = i ((dt 1—0 T Tt)? d&a) — edis: T-ri o7 (T—n)/2 
oldi) 
— el8g(St/K)+(r+02/2)(T-r:)-0?(T=r:)/2 _ St erT-n). 
we obtain 


d $, 
PP = Sid(di i) VT — m + <= (de) (ke PTa Sree) t K -&) 
t 
d 
= weld) VT — nu Alda) Sr — St) = SeO(dt) VT — m. 
t 


c) R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 


Solution 9.20 (Elicitability of the exponential risk measure) Ex. p. 75 
From the hint, we have that 


S(y,l) = rA s(m, I) dm = [m + (1/a) exp(a(I — m))]? = y + (1/o) exp(a(l — y)) — (l + 1/a) 
= y—1+ (1/a)(exp(—a(y - 0)) — 1). 


We now check that this is a valid scoring function according to MFE (2015, Definition 9.6): 
i) S(y,l) 20, y,l € R, and S(y,l) = 0 if and only if y = l; 
ii) S(y,l) is increasing for y > | and decreasing for y < l; 
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iii) S(y,l) is continuous in y. 

Concerning i), note that (x) = x + (1/a)(exp(—ax) — 1) is differentiable with $'(r) = 
1 — exp(—az) and (x) = aexp(—az) > 0 for all x € R. Therefore, 9 is strictly convex 
and attains its unique global minimum 0 at x = 0. This proves i). For ii), y > S(y,l) is 
differentiable with 


d 

dy? U^ l) = s(y, l) =1- exp(a(/ m y)), 

which is negative for y < l and positive for y > l. Finally, as a composition of continuous 

functions, S(y,l) is continuous in y and thus iii) holds. 
Next, we show that oẸ®(L) is elicitable. First, for L with E(exp(aL)) < oo (the domain 


of oP), it holds that E(S(y, L)) < oo for all y. We have to show that for such L, oc*P(L) 
minimizes the mapping y +> E(S(y, L)). To this end, note that 


a 
dy 


E(S(y, L)) = E(s(y, L)). 


By (El), y > E(s(y, L)) = 0 if and only if y = o&P(L), that is for y = o&*P(L) the first-order 
condition i E(S(y, L)) = 0 is satisfied. By the strict convexity of 5, this first-order condition 
is also sufficient to guarantee that o£*P is a global minimum of y +> E(S(y, L)), which shows 
for the scoring function S. 


the elicitability of o&*P 


a 


Note. A similar argument may be used to show that any risk measure based on a strictly convex 
loss function is elicitable, which is the ‘only if? part (necessity) of MFE (2015, Theorem 9.10 (a)). 


Solution 9.21 (Accuracy of the empirical quantile estimator) 


a) Let x € [0,n) be fixed. We show the relationship between the binomial and the beta 
distribution by showing that the derivatives F'(p) and G'(p) of F(p) = Fy(a;n,p) and 


G(p) = G(p; |x| + 1,n — |x|), respectively, coincide for all p € (0, 1) and that F(0) = G(0). 
Note that F(p) — pu (p^ (1 — p)”-*, so 


Ed ET 
F'(p) => K kp (1 — p^ ^ - 7 H (n — k)p"(1 — p)” =*= 


i k=0 
- Lz] 
"à (, - ) (n—k+ Up 0-9" ^-» 9 (n — k)pk(1 — py-*3i 
k=1 mr 
[z|-1 " 
=F (y ga cam (Janene 
k=0 mn 
= zn (n [Dope |z | !(r Tu — D Spe e 


pl#l(1— pe-la- 
blz] - 1n — |z]) 
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where we interpret the empty sum as 0. Furthermore, 


F(0) = o o E "E at [ EL 2 u)^-lz] 1 e 


*tln-|z|) 
b) Let N = 577.4 Iyu,<u} and note that N ~ B(n, u). Then 


Fug, (u) = P(Ug € u) 2 P(N > k) = Fy(k — 1;n, u) S G(u; k,n — k 4- 1), 


a 


c) Since X = Fi (Ug), we obtain that 


Advanced 


G(0). 


u € [0, 1]. 


Fx y (2) = P(X) € x) = P(X (Uu) € £) = PUR) € Fx(x)) = G(Fx(z); k,n — k +1) 


and thus Fx ay (0) = FY (G* (a, k,n — k + 1)). 
d) According to (E1) for u = Uy, 


Xy = qx (U) ~ ax (605) + dx (005) (Ug — 905). 


Since the Beta(a, b) distribution has mean a/(a + b) and variance ab/((a + b)?(a +b + 1)), 


we have that 6(,) = EU(,) = k/(n +1). It follows that 


and 


var( Xi) © dx(k/(n + 1)? var(Ug;) = d (E/(n + 1)) 


9 k(n—k-4 1) 


E(X) © qx (605) + ax (60) EU) — 005) = ax (605) = ax (k/(n + 1) 


Solution 9.22 (PCA factor model for US zero-coupon bond yields) 


(n+ 1)?(n 4-2) 


R code accomplishing this task can be found on the QRM Tutorial website qrmtutorial.org. 
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10 Solutions to Credit Risk 


Review 


Solution 10.1 (Management of counterparty risk) 

Counterparty risk refers to the risk that one of the partners in a financial transaction (such 
as an OTC derivative trade) defaults during the transaction and is thus not able to fulfill his 
commitments in full. 

Challenges in managing this risk class arise from the fact that the size of the counterparty 
credit exposure is not known a priori and it is not even clear a priori which of the parties has 
the credit exposure (this might change with the economic value of the underlying transaction). 

There are various mutually non-exclusive approaches for managing counterparty risk. First, 
counterparty risk has to be taken into account in pricing and valuation of derivatives; this 
has led to the notion of credit value adjustments (CVA). Second, counterparty risk can be 
controlled using risk-mitigation techniques such as exposure limits, netting and collateralization; 
further details are given in MFE (2015, Section 10.1). 


Solution 10.2 (Bond investment under decreasing credit quality) 

A possible strategy is to take a protection buyer position in a CDS on A and to cancel the 
position after the spread of A has risen by taking a protection seller position as well. The profit 
is the positive spread difference. Risks associated with this strategy are counterparty risk of 
the CDS positions and the risk of wrong expectations on the development of the spread of A. 


Solution 10.3 (Rating transition matrices) 
By row, the missing numbers are 0.05, 0.7, 0, 0, 1. Squaring the matrix we get 


0.80 0.15 0.05 0.80 0.15 0.05 0.655 0.225 0.120 
0.10 0.70 0.20 | x | 0.10 0.70 0.20 | =] 0.150 0.505 0.345 
0 0 1 0 0 1 0 0 1 


and the two-year default probabilities of A-rated and B-rated obligors are 0.12 and 0.345, 
respectively. 


Solution 10.4 (Credit risk in the Merton model) 

a) In Merton's model default happens in T if Vp € B, where Vr denotes the value of the 
firm's assets at time T'. The value of equity at the maturity date T' is therefore equal to 
Sr = (Vr — B)*; this is the payoff of a call option on Vr with strike price B. For the 
liabilities it follows that Br = min(Vr, B) = B — (B — Vr)*; this is the value of a risk-free 
bond and of a short position in a put option on Vr, both with maturity T. 
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10 Solutions to Credit Risk 
b) The default probability in Merton’s model satisfies 


$ 


log(B/Vo) — (uv — THBIT) 
ay v T 


see MFE (2015, equation (10.6)). Substitution of the given parameters gives 


P(Vr < B) = P(log Vr € log B) = a( 


log(1/2) — (0.1 — 0.04/2) 
0.2 


P(Vr < B) = a( ) = 5.5377 - 1075. 


c) Strengths of the Merton model include that it allows for a clear economic interpretation of 
the default event and that the calibration to economic quantities such as balance sheet data 
is possible. A weakness of the model are the unrealistically low short term credit spreads 
produced by the model; see MFE (2015, Section 10.3.2). 


d) As explained in a) the payoff of the risky debt at the maturity T' equals the nominal value 
of the debt B minus the payoff of a European put option on Vp with exercise price equal 
to B. The price of the debt in t < T is thus given by 


B; = Bpo(t, T) e Poe, Vir, ov, BT), 


where PPS(t, V: r, oy, B, T) denotes the Black-Scholes price of a European put with strike B, 
maturity T on (V;) for given interest rate r, and volatility oy; see MFE (2015, Section 10.3.2) 
for the exact formula. Now the term Bpo(t, T) is independent of oy and it is well known 
that the value of a put option in the Black Scholes model is increasing in the volatility of 
the underlying, so that the value of the risky debt is decreasing in oy. This explains why 
bondholders have an incentive to prevent a firm from undertaking risky projects (which 
might increase asset volatility oy), even if these projects are on average profitable. 


Solution 10.5 (Market-based credit risk management and procyclicality) 

'The default probabilities estimated by such a system are typically fairly low in times of expansion 
where stock markets rise and high in recessions when stock markets fall. This means that banks 
face high capital requirements for credit risk in recessions and low capital requirements during 
boom times. This provides an incentive to shrink lending activity in recessions and to expand 
lending in boom times. If such a strategy is followed by many banks, economic cycles might be 
exacerbated. 


Solution 10.6 (Limitations of risk-neutral pricing) 

If properly applied, risk-neutral pricing leads to arbitrage-free prices of credit-risky securities, 
which are consistent with prices quoted in the market. This makes the risk-neutral pricing 
approach the method of choice for valuing credit instruments when the market for related 
products is relatively liquid. On the other hand, since risk-neutral pricing models have to 
be calibrated to prices of traded credit instruments, they are difficult to apply when we lack 
sufficient market information. 
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a) Under the traditional pricing methodology a loan is taken on the balance sheet if the spread 
earned on the loan is deemed by the lender to be a sufficient compensation for bearing the 
default risk of the loan. Here default risk is measured using the real-world measure and 
historical (default) data. Such an approach is well suited to situations where the market for 
related credit instruments is relatively illiquid (or non-existent). 


Given these considerations, the retail loan would be priced using traditional loan pricing. 


b) The option to enter into a CDS contract is a typical credit derivative that would be priced 
via risk-neutral pricing in an arbitrage-free pricing model that is calibrated to observed 
CDS spreads. 


Solution 10.7 (Models with stochastic hazard rate) 

In models with deterministic hazard functions the only risk factor affecting a defaultable bond 
or a CDS is default risk. Hence in these models prices of bonds or CDSs and credit spreads 
evolve deterministically prior to the default 7. Hence the distribution of the value of the 
product at some T > t is a two-point distribution; one point corresponds to the value of the 
product if 7 > T, the other to the value in the scenario where 7 < T. This ignores completely 
variations in credit spreads which are a key driver for the market risk of bonds or CDSs. 


Basic 


Solution 10.8 (Risk-neutral valuation) 


a) The expected payoff is (1.025)-1(0.99 - 1 + 0.01 - 0.4) © 0.97 and hence bigger than pi(0, 1). 


b) The risk-neutral default probability q solves 
p1(0,1) = (1.025)! (1 — q) - 1 + q- 0.4). 
It is uniquely determined and equal to q — 0.025. 
c) We get from the risk-neutral pricing rule that the price Vo is given by 
Vo = (1.025) !((1 — q) -0 + q- 1) = (1.025) 10.025 = 0.0243. 


Denote by £4 and £» the position in the defaultable and the default-free bond. The replicating 
portfolio solves the following linear equations 


£1 -0.4 + £2 1.025 = 1 (default), 
&1 - 1.0 + ¿2 - 1.025 = 0 (no default). 


From this we obtain £4 = —1.6667 and £» = 1.6260. 


Solution 10.9 (Estimating transition probabilities for a discrete-time Markov chain) 
Conditional on N;; the numbers migrating to each state k have a multinomial distribution, so 
that 


lij! ley hi X 
P(Nijo = lijo, ..., Nejn = lain | Nij = lg) = ae cup. lyn = ly 
tjo: tjn: k=0 
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10 Solutions to Credit Risk 


leading to the log-likelihood 


T—1 n 
log L = log L((pjx) | (Ni), (Nijk)) 5 es y? (ost Nu!) + Y Nikl log Pjk — log) : 
t=0 j=1 k=0 


We maximize the log-likelihood with respect to each parameter pj; subject to the constraints 
that M75-opjk = 1 for j = 1,...,n. The Lagrangian is 


log Le (ops - 1). 
k=0 


j=l 


T—1 Nüx 
t=0 Pjk 


Differentiating it with respect to pjg gives the equations 5; + Aj = 0, leading to 


2 Nijk = —Àjpjk: 


m 


ges 
Zu) 2- Nc ey Dors- 
t=0 k=0 


Hence, as claimed, the maximum likelihood estimator of pj; is Pjk = SE Nijk/ S Nij. 


Solution 10.10 (Default probability in Merton’s model) 
The default probability in Merton’s model is given by p = (I) where 


= mo E £ 
pa 08 Blog Vo - (uv —ov/2)T _ 1 log(B/Vo) — nvT | Lov. 


ay v T ov VT 


Since 6 is strictly increasing, the default probability p is strictly increasing in oy if and only if 
a 0. We get 


9 , HvT-tlog(Vo/B) | 1 VT 
Ooy / Por, 2 : 
which is clearly positive if Vy > B and py > 0. 


Note. The conditions Vo > B and py = 0 are sufficient to guarantee that 35 sol > 0 but not 
necessary. In fact, there are parameter combinations with Vo « B or with uy < 0 such that 
sol > 0, but it is also possible to find parameters such that aI <0. 


Solution 10.11 (The Pareto survival model) 
a) The density of the Pa(a,«) distribution is 


d — eye -k Qa ix Qa -= 
J0)— FU) = (I) eim xar vg pes 


Hence q(t) = f(t)/ F(t) = a/(K +t). Clearly, this expression is decreasing in t. 
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b) Let fa denote the density of A. Since (7| A = A) ~ Exp()), 
P(r » t) zi Besi[Ae3gaQ) det 
0 


= ke oo ai -X(&H) g e Ko Tr(a) — Fü 
! S ^ PIS 4 


where we have used the form of the norming constant of a Ga(a, « + t)-density. 


c) Using the formula for the moments of the Pa(a, x) distribution we get that E(r) = &/(a — 1) 
and 


eo ae sre - 2k? k? " Ka 

MONE) EME) Sa aay a aaa) 

On the other hand we have E(7) = 1/A = «/(a—1) = E(r) and var(7) = 1/3? = &?/(a.—1)?. 
Hence var(7) > var(7) if and only if a/(a — 2) > 1 which obviously holds for a > 2. 


Solution 10.12 (The Gompertz survival model) 


a) Since the density of F is given by f(t) = exp(—5(e ges Yee — F(t)ae", the hazard 
function of the Gompertz distribution is y(t) = f(t)/F(t) = ae and the cumulative hazard 
function is P(t) = fp (s id = $(e% — 1). 

b) If yem(t) = vya(t) + c = ae" + c, the corresponding survival function Fay is 


Faw(t) = exp (- nc t c) ds) = exp(— [gers + es]. ) = exp(-$ (e^ qe ct) 


and hence the distribution function Fem is Faw(t) = 1 — exp(-£(e* — 1) — ct). 


Solution 10.13 (Reduced-form credit spread under different recovery assumptions) 

a) Under RT the payoff of the bond at the maturity date T is Ir; py + (1 — ó)Ig zm), where 
the first summand is the survival claim payoff and the second summand comprises the 
payoff of the recovery. This simplifies to ôIt,>r} + (1 — ô) and hence the price of the bond 
is (recall that 7 > t by assumption) 


pi (t, T) = E? (polt, T) (615... 7, + (1 — 6)) | He) = (1 — é)po(t, T) + SE? (po(t, TE my | He). 


The last expectation is the value of a survival claim at time t and we obtain 


pi(t, T) = (1 — ô)po(t, T) + ópo(t, TJIesy exp(- [ r (sds). 


The spread is thus 


een prn [oe 
et Ty = Tot 8T) T. — ô + dexp 49 (s) ds 


Using exp(—x) ~ 1 — a and then log(1 — x) ~ —z for x small, we have for T — t+ that 


e(t, T) = — z= log(1 — NT — 0) e 6 (0. 
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10 Solutions to Credit Risk 


b) 


Under the RF recovery model the value of the survival claim associated with the bond is 
given by 


T 
polt, T) expl f 7%(s) ds). 


To value the recovery payment we consider 


EQ (T po(t, T)) 
Os boss 24) — (1 — 61g, {t<r<T} , 
((1 — 8)po(t, 7) Ltt<cr<r} | Ht) = (1 — 9) exp(— fy 72(s) ds) 


—-(1—-64p5sg f st s)y9(s) exp(— ri 9 (u) du) ds. 


Now note that po(t, s)/po(t, T) = 1/po(s, T). Therefore, on (7 > t}, c(t, T) equals 


Lene a) f en [2] 


For T — t+ we get as in the case of the RT model that c(t, T) ~ dy2(t). This is easily 


derived from the above equation, using that lim. 44 c(t, T) = =a T) 


Ir-c 


Ex. p. 80 Solution 10.14 (Pricing and calibration of a CDS) 


a) 


Since we consider the valuation date t = 0 all conditional expectations with respect to Hi 
become ordinary expectations. Note that Q(rg > i/4) = e-G/7. Substitution in the 
valuation formula for the premium leg of the CDS, see MFE (2015, equation (10.39)), gives 


20 
rem = —(& —(& a T Q) 
v? (x;72)) — ye Gv e /4 e G/4)(r-y*) 


Similarly, we get from for the valuation formula for the default-payment leg, see MFE (2015, 
equation (10.40)), that 


a enm, 
0 


AQ 
Vets EY T2 —(r4-39)s = E —(r4-39)s 
23 [o zQ ds sse 


'The value of the cash-flow for an annualized fixed spread x from the viewpoint of the 
protection seller is finally given by WP ^" (x; 59) — Wdet(59), 

For the approximate formula, we have 72 ~ z*/ó = (42/0.6) bp = 70 bp. Since r and 59 
are constant, exact calibration of the hazard rate is possible based on the equation 


s 0.25 -Q 
f e-Q/0(452) — | e-+) gs = FOF (1 — e-0/90+9) 
rs 06] 7%e ds r4 500 e ) 


for 72 (see MFE (2015, equation (10.43))); substituting z* = 0.0042 and r = 0.02, numerical 
root finding leads to 72 = 0.00698 = 69.8 bp. 
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Basic 


While the calibrated value for 72 can be used for the pricing of other credit products on 
R, it should not be used for VaR calculations, since 7° is (an estimate of) a risk-neutral 
hazard rate whereas for VaR calculations an estimate of the hazard rate under the historical 
measure is needed. 


Solution 10.15 (Application of the Feynman-Kac formula) 


a) 


Denote by (V) the solution of the stochastic differential equation (SDE) d, = u dt +o dW;. 
This is simply an arithmetic Brownian motion, and the solution is given by V, = Yo + ut + 
oW;. Denote by (F+) the filtration generated by (Y+). Using the Feynman-Kac formula, 
see MFE (2015, Lemma 10.21) (with h() = 0, R(v) = 0 and g(v) = v?), it holds that 
f (t, Vz) = E(V2.| F,). By the Markov property of (V,), we obtain that 


f(t, v) 2 E(V$. ,|Vo = v) = E((v + n(T — t) + oWr-i)?) 
— y+ (T — t)? + E(oWz 4)) -29g(T — t) + 2( + uT — t)E(oWr. 1) 
=? +p’ (T — t) + 0° (T — t) -29u(T — t), (S1) 


where we have used that E(Wr—+) = 0, E(W2 ,) =T — t. Clearly, f(T, y) = v?, and 
elementary calculations show that (S1) does in fact solve fi(t, v») J- p fu (t, b) 4- lo? Just; uy = 
0. 

In this case the SDE associated with the PDE is dV, = pW; dt--o Y, dW. This is a geometric 
Brownian motion and the solution is given by V, = Wo exp(oW; + (u — o?/2)t). Using 
the Feynman-Kac formula, see MFE (2015, Lemma 10.21) (with h(w) = 0, R(v) = r and 
g(w) = log w), it holds that f(t, Y+) = E(e- 7? log(Wr) | F). By the Markov property of 
(V4), we obtain that 


F(E, W) = E(e 7? log(Wr 4) | Yo = y) 
e "(T9 (logy + E(oWr— + (u — o?/2)(T — t))) 
= enr (log yb + (u — 0?/2)(T — t). (82) 


Clearly, f(T, Y) = logy, and elementary calculations show that (S2) does in fact solve 


Solution 10.16 (Vasicek model) 


We apply MFE (2015, Proposition 10.23). We have o? = 0 and pt = 1; u(y) = a(b — v) = 


k? + ka for k? = ab and kt = —a; o? (sp) = o? = h? + hiy > 0 for h? = o? and h! = 0. MFE 


(2015, Assumption 10.22) is satisfied and Proposition 10.23 implies the following ODE system, 


see MFE (2015, equations (10.66), (10.67)): 


BGT) =1+ aB(t,T), p(T, T) =0, 
o/(t, T) = 0 — abB(t, T) — 0° 8° (t, T), o(T,T) =0. 
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Note. This system is easily solved as the ODE for £ is linear. We obtain 6(t,T) = —(1 — 
p 9 /a; the expression for a is known explicitly. However, it is quite long and therefore 
omitted. 


Solution 10.17 (Simplified CDS pricing) 


a) 


We get from risk-neutral pricing that the price of the modified default payment leg equals 


N F 
ò y» E^ (e fo e en emi) f 
i=1 


Since Jr, cen) = Iit) — Ipgnuy the result follows. With positive interest rates 
the value of the modified default payment leg is lower than the standard default payment 
leg, since the default payment is received at the first premium payment date after 7g (so 
later). 


Note that with a deterministic short rate, 
Q if? Ts ds 
EQ (e7 Jo "EO sus Ieegsi)) = PCO, ti) (QCR > ti-1) - QCR > ti)). 


Rearranging the terms of the sum gives the result as stated. 


We first consider Q(TR > t). We know from MFE (2015, (10.53)) (applied with 7 := Tp, 
t:=0, T := t, rs := r, Ys := Vs, Fe = OV: s < t}) and X = 1 almost surely) that 


t 
E? (e rt d V, ds 


EL( 


e" Ig | Go) = Ing so) Fo) 


t 
and thus that Q(rg > t) = E (Itrp>1}) = E9(e So Yds). For a CIR process, we obtain 
from MFE (2015, p. 419) that the latter expectation equals exp(a(t) + 8(t)Wo), where 
the functions a, B are given in MFE (2015, (10.70), (10.69)); note that in these two cited 
equations, p° :— 0, p! :— 1 and 7 := t. We can thus evaluate Q(rg > t). 
With po(t, T) = exp(—r(T — t)) for r = 0.01, we obtain from (E2) (applied with t; = $T, 
i — l,..., N, and N = 20) that the default leg is 
N-1 


Vj = 0.6 (moto t2) — p(0, tw)Q(n > tw) + X- Q(tR > ti)(po(0, ti+1) - (0.5) 
gel 
N-1 
= 0.6 Gs —e"TQ(tr > tw) + XC Q(ra > t)(e 7 — a) = 0.099945. 


i=1 


The premium leg for a unit spread is given by MFE (2015, (10.39)) (with t :— 0, x := 1) as 
N 
WP? = M po(0, ti) (ti — ti-1)Q(TR > ti) = 4.436493. 
i=1 


The fair spread is thus z* = Vet /WP"*"* zz 2.2528%. 

Now consider the remaining part of the question. Clearly, increasing Vo or 0 will increase 
the spread as it makes default more likely. Increasing c typically lowers the spread, for 
instance, for o = 0.25 we obtain that x* = 2.236696; for o = 0.15 we have x* = 2.265796. 
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Solution 10.18 (Matrix exponential for continuous-time Markov chain) 
The matrix of transition probabilities is given by 


P(t) = exp(At) -5 rat 


and the generator satisfies A^ = (ADA~!)* = AD*A-!. Hence, 


Advanced 


© qk ee oo qk ; m oo qk : i S 
E n. -A(X gr) -A(X gted dA 


from which the formula follows. 


Solution 10.19 (Calibrating hazard rate model when CDS spread curve is flat) 


a) We show that the right-hand side of the claim equals the left-hand side. To this end, the 


change of variables s = t — (k — 1)At leads to 


oyt N pkAt ES 
572 [ve OM dt ss | e CH dt 
ci Jt=(k—1)At 
NOR 5 6 
-eusj e (HAA (k-DAt-(r+7%)5 ds 
k=1 ” 550 
N 
= Y e EMAL Sy / -59)s ds 
k=1 


N 
= x At 5 eT CHICKA NEN. 
k=1 


b) Under the given assumptions, we obtain from a) that the same value of 72 can make the 
value of the premium and default legs equal for a CDS with any maturity, that is any value 


of NEN. 


Solution 10.20 (Risk management for a corporate bond) 


The crucial point of this exercise is to distinguish carefully between risk-neutral quantities and 


quantities that have to be simulated using the historical measure P. 


a) The price of the bond at time t is given by 


2 140 
Itzi) EQ (e Ji r+ ds 


Fi) = (1 a Yi)p(1, 2, 1; r, 2, «2, 02,0). 
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Ex. p. 82 Sol 
a) 


b) 
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Hence, to generate a realisation of the bond price at t = 1, we need to generate a realization 
of 1 — Yi and of the risk-neutral default intensity 4. using in both cases P-dynamics. This 
leads to the following algorithm for generating one realization of the loss associated with 
the bond: 


1) To simulate a realized trajectory of the default indicator Y = [y,<;3,0 < t < T, we use 
threshold simulation as follows; see also MFE (2015, Algorithm 10.13). 


i) Generate a trajectory of w using the P-parameters «P, 0*, o up to T = 1 and compute 
the integrated process IT — Jh V.ds,t € T. 
ii) Generate E ~ Exp(1) independent of the simulated trajectory of y. If TẸ < E put 
Y: = 0 for 0 € t € T. Otherwise, let 7 :— inf{t > 0: p? > E) and put Y; = 0 for 
t<7rand Y,;=1 fo t>. 
2) Return the loss 


Po — Iti, 2, V1; T, 2, Ke, 92, o). 


Repeating this algorithm many times leads to an approximation of the loss distribution 
based on which VaR can be computed by empirical quantile estimation. 


To price an option via Monte Carlo the algorithm from a) needs to be modified in order 
to account for the fact that now we need to generate a trajectory of the default indicator 
assuming that r has the hazard rate process y and we need to work with the Q-parameters 
in the hazard rate dynamics. T'his leads to the following algorithm: 


1) Generate a trajectory of i) using the Q-parameters «9, 09, & up to T = 1 and then a 
trajectory of 49 = 2y. 
2) Generate for the trajectory of ^? a realization of 1 — Y; ‘under Q’ via threshold simulation. 


3) The payoff of the option for this realization of y and Y is 
Do — (1 HE Yı)p(1, 2, VA; T, 2, KÊ, 92. c). 


Repeating this algorithm many times and taking the average of the option payoffs generated 
in this way gives an approximation for the price. 


ution 10.21 (Credit value adjustment formula) 


Denote by E(V,*) the expected positive part of the market value in t; note that B suffers a 
loss only if the market value of the contract is positive for him. Hence, the formula gives 
the expected loss for B if S defaults before maturity. 


In case that S is a big reinsurance company one would expect that E(V,* | ra = t) > E(V;*) 
since a default of S could be caused by a catastrophic event that leads to an increase of the 
premia for reinsurance contracts. Moreover, the event that caused the default of S might be 
covered by the reinsurance contract between S and B so that B does not receive the claim 
payment (s)he is entitled to. 


Advanced 


c) We first compute E(V,*). Let a = ot and V;  N(0,02). Using the first part of the hint, 
we obtain that 


E(V;^) = —_ T ze 9a, — 1 [-ater ery) = ae 
0 


and thus E(V;*^) = ot/(27). Substituting in the CVA formula and using the second part of 
the hint, it follows that 


T 
CyAindep — sss f te OS ts t1)t dt 
2m 0 


= ôs TYS ( ! e- Qs r)T (r 4 l ) ) ; 
2n(4s + yB +r) \ys +y +r ys +w +r 
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Review 


Solution 11.1 (Dependence in credit risk management) 

a) The economic health of different obligors is affected by common (macroeconomic) factors, 
and there may be direct effects due to contagion and counterparty risk. 

b) The mode of the loss distribution moves to the left of the mean, and there is substantially 
more mass in the right tail than in the case of independent defaults; see MFE (2015, 
Figure 11.1). It is therefore important to do robustness checks with respect to the parameters 
governing default dependence, both in risk capital computations or in the valuation of 
securitization products such as CDOs. 


Solution 11.2 (Gaussian threshold models) 


a) In a Gaussian threshold model one has that 


Xi = V/BiFi + VA — Bei, 


where F, and €1,...,€m are independent standard normal variables, and where 0 € 6; € 1 
for all i. Moreover, for all i F; is a linear combination of a fixed p-dimensional factor vector 
F, which is identical for all obligors. Factor models are relatively parsimonious, as the 
number of parameters grows only linearly with the portfolio size m, and the factors can 
be interpreted as country and industry effects. Details on Gaussian threshold models with 
factor structure are given in MFE (2015, Section 11.1.3). 

b) The asset correlation of obligors 7 and j is the correlation between the critical variables 
Xj; and X;. In a Gaussian threshold model the latter is given by given by p(X;, Xj) = 
Bib; cov(F;, Fj). The default correlation is the correlation p(Y;, Y;) of the corresponding 
default indicators Y; and Y;. 

c) The threshold d; needs to satisfy P(X; < d;) = pi; since X; is standard normal, this gives 
d; — $-1(d;). 

d) See MFE (2015, Example 11.11). 


Solution 11.3 (Tail dependence in threshold models) 


The probability that firms 21,..., 2; default is equal to the probability that the critical variables 
Xj,..., Xi, are simultaneously below the thresholds d;,,...,d;,. Typically, default probabilities 
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are small so that the d;, correspond to a low quantile of the distribution of the X;,. Hence lower 
tail dependence makes the occurrence of large default clusters more likely. For a numerical 
illustration see MFE (2015, Section 11.1.5). 


Solution 11.4 (Exchangeable Bernoulli mixture models) 
a) It holds that 7 = E(Q) and py = mame where m2 = E(Q?). Moreover cov(Y;, Y;) = 
var(Q) > 0, see MFE (2015, Section 11.2.2) for details. 


b) Denote by M(™ = Y7?* , Y; the number of defaults in a portfolio with m obligors. According 
to asymptotic results for large portfolios such as MFE (2015, Proposition 11.18) it holds for 


large m that go (7) X qo (Q), that is VaRa (Mf) z m VaRq(Q). 


m 


Solution 11.5 (Importance sampling in credit risk models) 
a) See MFE (2015, Section 11.4.1), in particular Algorithm 11.22 and the subsequent discussion. 


b) In large portfolios the variance of the portfolio loss stems mostly from fluctuations in the 
factor V and less so from the variance of the conditional loss distribution. 


Basic 


Solution 11.6 (Calculating moments of portfolio credit loss distributions) 

Note that the default indicator of obligor i € {1,...,m} is Y; ~ B(1,p;) (in particular, 
E(Y;) = E(Y2) = p; and var(Y;) = pi(1 — p;)) and that, throughout this exercise, p; = p = 0.01 
for all i € {1,...,m} and m = 1000. 


a) We need to compute the first two moments of L = 77:4 ó;Y;. Since 6; = 6 = 0.4 for all 
i € (1,..., m}, we have that E(L) = móp = 4 and var(L) = mó?p(1 — p), so sd(L) ~ 1.2586. 

b) Here we also need to compute the first two moments of L = 7", 6;Y; and we have 
6; = ô = 0.4 for all i € (1,..., m]. As before, we thus have E(L) = móp = 4, but now 


var(L) — 8? wx( 0%) = P(Y var) +2 5 cov (Y Y;)) 


1<i<j<m 


= 8? (mp —p)+2 (2) cov(Y1, Ya) ) = ó^(mp(1 — p) + m(m — 1) cov(Y1, Y2)). 


Since cov(Y1, Y) = corr(Y;, Yj) /var(Y1) var(Y2) = pp(1 — p), we obtain 
var(L) = 8 (mp(1 — p) + m(m — 1)pp(1 — p)) = Èmp(1 — p)(1 + (m — 1)p), 


so sd(L) ~ 3.0816. 
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In this case we need to compute the first two moments of L = 377, A; Y;, where E(A;) = 
a/(a + b) = 0.4 = ô for all i € {1,...,m}. Therefore E(L) = mE(A1)p = 4 as before, but 


var(L) = ps var(A;Y;) + 2 5 cov(A;Y; A; Y;) 


i=l 1l<i<jsm 


= m( E((A1Y1)?) — E(A1Y1)?) +2 D ( E(A1Y1A2Y3) = E(A1Y1) E(A2Y2)). 


Because of the independence assumptions, 


var(L) = m(B(A3)E(¥2) — E(A1)PE(Y:)?) 
+m(m — 1)E(A1)E(A2)(E(Y1Y2) — E(Y1)E(¥2)) 

= m/( E(A?) E(Y1) — E(A4)? E(Y1)°) +m(m— 1) E(A,)? cov(Yi, Y2) 

= mpE(A1) — mp*E(A1)* + m(m — 1)E(A1)^ pp(1 — p) 

= mpvar(A1) + mpE(Ai1)? — mp?E(A1)? + m(m — 1)E(A1)?pp(1 — p) 

= mpvar(Ai) + mp(1 — pJE(A1)? + m(m — 1)E(Ai1)?pp(1 — p) 

= mp var(A1) + mp(1 — p)E(A1)*(1 + (m = 1)p). 


With E(A;) = z% and E(A1) = ee we obtain var(A1) = GRO which 


leads to sd(L) ~ 3.0978. 


Note. Note how adding default dependence has a profound effect but adding independent 
stochastic LGDs has a rather minor effect; relaxing the independence assumptions for LGDs 


would lead to a greater effect. 


Solution 11.7 (Correlation bounds for default indicators) 
a) For j € {1,2}, let X; ~ B(1,p;), pj € (0, 1), with distribution function 


0, c «0, 
F;(x) = 1 = pj, x € [0,1), 
T; qoe 
and corresponding quantile function 
—oo, u-—0, 
Fj (u) = 0, uc (0, 1 — pj], 
1, ul — Pj. 


We know that p € [Pmin, Pmax| and p = Pmin if X1, X2 are countermonotone and p = pmax if 
X, X2 are comonotone. Consider the former. Then X; = F$ (U), X2 = Ff (1 — U) for 
U ~ U(0,1). Note that, almost surely, X1 = Iyysj—p,} and X2 = Ir us4- p, Therefore, 
E(X1X2) = E(IHtuz1-p M -Us1-p;) = EUgusi_p,,1-us1—p}) = P(U > 1- pı, 1-U > 


265 


Ex. p. 86 


11 Solutions to Portfolio Credit Risk Management 


1— p2) = P(1— p; < U < p2) = max(pi + po — 1, 0} = W(pi, pa). Since X; ~ B(1,pj), 
j € (1,2), we also know that EX; = p; and var X; = p;(1 — pj), j € {1,2}, and thus that 


auus EUEG)- PRO... Wpupo ie 
Vpi(1 — p1)p2(1 — p2) | ym(1-— pi)p2(1 — p2) 


Similarly, for comonotone X4, X2, which almost surely allow for the representation X, = 
Itys1-g,) and X2 = Ity51- pj for U ~ U(0, 1) and for which E(X1 X2) = P(U > 17251, U > 
1—p3) = P(U > max(1—pi, 1—po}) = 1—max(1—pi, 1—po} = min(pi, po} = M(pi, p2). 

Figure $.11.1 shows pmin and pmax, respectively, as functions of p1, po. The lower correla- 
tion bound pmin is especially large if p? and po are both small or both large simultaneously, 
whereas the upper correlation bound pmax is especially small the further apart p; and po 


are. 


Pmin(P1, P2) 
Pmax(P1, P2) 


RNS Se di 
WER ROKK? 
RAN 


Figure S.11.1 Minimal correlation pmin (left) and maximal correlation pmax (right) as functions 
of pı, p2 for B(1, pj) margins, j € {1,2}. 


b) If p= pı = po, it follows from a) that 


2 
max(2p —1, 0} —p =(1 1/p)!texi/31-1t»1/2 
p(1 — p) 


Pmin — 


and pmax = 1. Since pmin = —1 if and only if p = 1/2, the range of p is maximal precisely 


for p = 1/2 in which case it is [-1, 1]. 
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Solution 11.8 (Asset correlations in Gaussian threshold models with two-group structure) 
Note that E(X;) = 0 and var(X;) = (VB)? + (VI = Bi)? = 1 for all i € {1,...,m}. Therefore, 
the asset correlation corr(X;, X;) equals E(X;X;) for all i, j € {1,...,m}. Since 


E(X;X5) = E((VBiFi + VI = Bie) (/ B5 FA + V1 — Byes) 
= /BiBjE(F2) -0--0--0— BB, ijsm ij, 


and 


E(X;X;) = E((/Bi(pPi + V1 — p? F3) + V1— Biei)(/ Bj(pFi + V1 — p? Fo) + /1— Bye;)) 
= /BiB;E((oF1 + V/1— p? F3)?) +04+040 = VAB (EloF?) + E((/1— p? F2)?)) 
= BiB;(p? + (A — PP) = V BiB; i,j >n, ij, 


and 


E(X;X5) = E((VGiFi + VI = Biei) (yBy(oFa + V1 — pF») + /1— Bye;)) 
v BiBjE(Fy(pFy + 1 — p? F2)) +0+0+0= /B;iB;(gE(F?) + V1 — p?E(FAF3)) 

eu B AEn 

we obtain that the within-group asset correlation is J/8;8; (for i, j both in the first or both 


in the second group) and the between-group asset correlation is J/B;f;p (for i, j in different 
groups). 


Solution 11.9 (Joint default probabilities in Gaussian threshold models) 
a) By conditioning on F, the probability of obligors (41,..., ig} C (1,..., m] to default is 


PGES dis- -3 Xip < du) = P(V ba F + V1- Pat duco F dx Bien) 


-afire ego) 
-FEE o dz. (S1) 
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b) In the exchangeable case, we can use the simplified notation 7; for the joint default 
probability and obtain from (S1) (with d; = d and 8; = p = corr( Xy, Xi) for all i, k,l) that 


n= [oo (GB anne [7 (PRISE ane 
where we used that, for each i, P(X; < d) = ®(d) = s. 


Solution 11.10 (Joint default probabilities in threshold models with Gumbel copula) 
Suppose that X = (X1,..., Xm) is distributed according to a joint distribution with Gumbel 
copula and equal marginal distribution functions F. Consider the threshold model (X, d), 
where d = (d,...,d) for d € (0,1). Clearly 7 = P(X; € d) = F(d). Furthermore, any 
k-dimensional marginal copula of an m-dimensional Gumbel copula is a k-dimensional Gumbel 
copula with the same generator, say v(t) = exp(—t!/?) for some 0 > 1. This implies that 


TE = PLA m duos 


= P(F(X1) < m,... 


Xi, < d) = P(X; < d,..., Xp d) = P(F (X1) < F(d),... 
,F(Xk) < T) = CU e serA) ek ylk! (r) = PATHE E 


,F(Xk) € F(d)) 
qr 


Solution 11.11 (Exchangeable one-factor Bernoulli mixture models) 

a) For k = 0, m = P(Y1 —0,..., Ym = 0) = E(P(Y; = 0,..., Ym 20|Q)) = ET] PY; = 
0|Q)) = E(TA4 (1 — Q)) = E((1 — Q)"), where we used that the Y;'s are conditionally 
independent given Q and that (Y; |Q) ~ B(1,Q). For k € {1,...,m} and 1 < jj <- < 
jk « m, using the same two properties, we obtain that 


ll 
n 
Fd 
~~ 
— 
eS 
Il 
n 
RS. 
+ 
o 
Fd 
A 
mL 
Ras 
Il 
o 
Vc 
Il 
ga. 
=> 
Eos 
Sale 
| 
AA 
-n 
—z 
oS 


py = corr(Y;, , Y;, ) ES E(Y5 Y) = E (Yj) E(Y;,) - ER) Si E(Y5)? 
VEC?) -EQ))EQ2) -E(YS?) — EO) - EQ 
m — n2 
|m-7l 


where we used that E(Y2) = E(Yj,) = m. 
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Solution 11.12 (Exchangeable beta mixture model) 


Ex. p. 88 
a) By Exercise 11.11 a), 


= T C jue bes 
m - E(a - 9" = [a - gE da 


| B(a,b+m) ie g* 1(1-—.9)9tm-1 . DB(a,b-c- m) — T(a)L (b -- m) (a+b) 
E o Blabt+m ^ Blab) I(acb-m)I(a)T (b) 
T(b+m)  L(a- b) (b 4- m — 1) --- (b 4- 1)0T (b) 


T(b) I(ac-b-m) T(b) 
T'(a +b) _ _ Ing (641) 
(a-- bm — 1) --(a- b -- 1)(a- b)TF(a b) — TT?! (a 4- 6 4- 1) 
-T bl 
pg eT 


Similarly, for k € {1,...,m} and 1 < ji <- < jk < m, 


Ty = P(Y; 21,..., Yj, = 1) = E(Q*) 


[= q)?! B B(a 4- k, b) T gere _ qi H B(a ES k, b) 
o? Bab) 4  B(ab Jo  B(atkb 4 Bab) 


. P(a-c k)T(b) F(a-- b) T(a+k) T(at+b) _ (a+k—1)--- (a+ 1)al'(a) 
= T(a--k-b)T(a)T(b Tia) T(at+b+k) © T(a) 
T'(a +b) —— Tbe @+) 
(a4-b-- k— 1): (a4-b-- 1)(a-- b)E(ao- b) — [IE (a 4- b 4- 1) 
E uds] 
zb. sm 


l=0 


b) By a), mı = E(Q) = a/(a + b) and m2 = E(Q?) = a(a + 1)/((a + b)(a +b + 1)). By 
Exercise 11.11 b), 


m-or wth (zm) — i -.  CiDGr) 
py — 2 = 2 = a = 
ce —— 1-3 
1a? -- a 4- ab -- b — a(a 4- b 4-1) 1 
m" atb+1 ~ abl 


From a/(a + b) = 71, we obtain that a+ b = a/rı. Since 1/(a+ b+ 1) = py, we have that 
1/(a/m + 1) = py from which it follows that a = v1(1/py — 1) and thus b = a(1/m — 1). 
With 7, = 1/100 and py = 1/200, we obtain that a = 1.99 and b = 197.01. 
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c) Let M =)", Yj. Conditioning on Q and using that (M |Q) ~ B(m, Q), we obtain that 


p, = P(M = k) = E(P(M =k|Q)) =E | (t) Qa - o^) 


dq 


| [m Batk,b+m— T géth-1( — g)tm-h-1 
i B(a, b) o Bla+k,b+m-—k) 


| [m B(a  k, b m — k) = m 
x(n) SBE mD, setom 


n S (Gea) 
= (P) (ETA, eeto- 


e) Figure S.11.2 shows the probability mass functions (pp)f% o and (pi)09 9 (left) and the 
quotient (py./pi.)t9.g (right). We clearly see the difference induced by the small correlation 
py and, especially from the graph on the right-hand side, the effect py has on the distribution 
of the number M of defaults; in particular, under the model with small but positive py, a 


large number of defaults is much more likely to happen. 


Ex.p.88 Solution 11.13 (Exchangeable probit-normal mixture model) 


a) The distribution function is 


-l(g). $71(z)-n 
foto) =o 8) E ian ree se. 


c o dx o¢(®-!(x)) 
b) In Exercise 11.9 we constructed an exchangeable one-factor Gaussian threshold model with 
eo $^ (m) + pz 
= Pe) ou fry EP, 
m-f oad = f L2 Jae) 
= [s e 02)0(2) dz, 
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a => Probability under beta-binomial Probability under beta-binomial 
= 4 — Probability under independence = I 
[z Probability under independence 
o 
Y 4 - Reference line y = 1 
o 2 
mo 
co x 
oH ec 2 
o o 
o 
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o 
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xt 
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Figure S.11.2 Probability mass functions (p,)e29 (under Q being beta distributed (parameters 


chosen to match 7, = 0.01, py = 0.005), so M following a beta-binomial model) 
and (jx) p29 (under independence, so M following a B(d, m1) model (so py = 0)) 
(both on the left) and the quotient (p;./i.)09. (right). 


where we used 


and o= — (S1) 


dabo p 
Obviously, 7; equals E(®(X)*) for X ~ N(,07) and thus the exchangeable probit-normal 
mixture model is equivalent to an exchangeable Gaussian threshold model. Furthermore, 
solving the second part of (S1) with respect to p, we obtain p — si Together with the 
first part of (S1), we thus obtain 7 = $(u/1— p) = $(u/V1 + o?). 
c) Since in Gaussian threshold models mı = 7, we have E(Q) = 7, = 7 = ®(u/V1+07). 


Solution 11.14 (Exchangeable Bernoulli mixtures with extremal mixing) 
a) The minimal correlation po = 0 is attained if Q is deterministic and equal to 7, that is for 
0 forO<q<m7 
Gmin(g) = 
1 frr<q<l; 
in that case the Y; are in fact independent. The maximal correlation p¥** = 1 is attained if 


Q = 0 with probability 1 — 7 and Q = 1 with probability 7, that is for 


l-r for0d<q<l 
1 forq—1 


G(g) = i 
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b) For the mixing distribution G™**(q), all obligors default if and only if Q = 1. We can 
therefore write all default indicators as increasing function of a given random variable. For 
instance one may take a standard uniform random variable U and let Y; = It554. 4, (this 
is not the only construction, of course). 


c) This follows as expected shortfall is subadditive and comonotone additive. In general 
G™?*(q) does not maximize VaR; a counterexample for m = 2 is provided in Exercise 2.20. 


Solution 11.15 (Two-factor CreditRisk* model) 

In CreditRisk*, we have Y;|( = y) ~ Poi(k;w'1»); see MFE (2015, Section 11.2.5). By 
comparison with the information given, we have k; — 0.01, à — 1,...,1000. Furthermore, 
wj = (wi, Wi2) with w; = 1 for à = 1,...,500 and wj, = 0.5 for à = 501,...,1000, as well as 
w= 0 for i = 1,...,500 and wy = 0.5 for i = 501,...,1000. We also know that the loss 
pertinent to obligor iis Li = e;Y;, where e; = 1. The conditional distribution of the aggregate 
loss L = XT Li given the factors W satisfies 


x 500 1000 
L| (T -)- ~ Poi( X50. Oly + M7 0.01(0.5u +0. 5u2)) = Poi(7.5y1 + 2.53). 
1—501 


In view of the convolution property of the Poisson distribution and the independence of 
the two factors V4 and Yə we see that L can be decomposed into two independent pieces 
L = LO + LO where O) is associated with the factor V, only and L2 with the factor V» 
only. 

We now show that LU), j = 1,2, is negative binomial distributed. To this cud; let Ay = 7.5, 
Ag = 2.5 and j € (1, " Then LY) | (V; = Wj) ~ Poi(A;v;) and AV; ~ Ga(3, z Dy ). It follows 
that LY) ~ NB(3, 155). 
distribution we obtain that g(L0)) = = Aj and var(L)) = A;(1 + 24), and thus we obtain 
E(L L) = = A1 + A2 = 10 and var(L) = A1(1 + 2A1) + à2(1 + 242) = 135. 

Note. We used the following distributional properties in this solution: 

a) If X; ^? Poi(à;), i=1,...,n, then 7%, X; ~ Poi(Y 7 4 A;). 

b) If X ~ Ga(a, B), then cX ~ Ga(a, cB) for all c > 0. 

c) If Y ~ Ga(a, 8) and X |Y ~ Poi(Y), then X ~ NB(a, 8/(1 + 8)). 

d) If X ~ NB(r, p), then E(X) = r(1— p)/p and var(X) = r(1— p)/p? = E(X)/p. 


Using formulas for the mean and variance of the negative binomial 


Solution 11.16 (Large portfolio asymptotics application) 
a) In the one-factor Gaussian threshold model with F = —W (see MFE (2015, Section 11.2.4)) 
we have 
Xi = V B;F + V1— Bie; = —vV/BiV + V1— Bi£i, 
where W,¢1,...,€m ^» N(0,1). The variance of the systematic part is var(—/;W) = 
and the tonditionsl default probability p; given V = yw can be shown to be 
TE Bi» 
i MEN uh 


272 


Basic 


where p; is the unconditional default probability; see MFE (2015, Section 11.2.4) for the 
derivation. Note that p;(w) is an increasing function in v. 

The loss coming from obligor i is L; = 0.6e;Y;, where e; is the exposure of obligor i and 
Y; its default indicator. Therefore, 


liy) = E(L;|V = v) = 0- P(Y; = 0| V = v) + 0.6e;P(Y; = 1| V = v) = 0.6eipi(V) 
$ (pi) + d 
vl- Bi 


= 0.6e,0( 


The total exposure in million when there are m obligors is am = (m/2) - 5 4- (m/2) -1 = 3m. 


This leads to the asymptotic relative loss function I(q) given by 


: 1 
ea, a 
_ l/m ,, - 5/9 (0.01) + V08PY m ,, , gf 8 (0.01) + V0.8) 
-x 0.6-5-0/ 309 )*3 o6.1.9( mz ) 
Moaea $ (0001) V020N m. -1(0.001) + /0.2: 
+7 06:5 o( UR ) de 0.6-1-0/ Tz )) 


where the four terms correspond to the risky loans for the amount of 5M, the risk loans for 
the amount of 1M, the safe loans for the amount of 5M and the safe loans for the amount 
of 1M, respectively. 

Sorting the terms according to risk and safe loans, respectively, we obtain 


$-1(0.001) + E 


+ 0.9 - a( 
V0.8 


By MFE (2015, Proposition 11.18) and a), the 99.9% value-at-risk of (19000) — 5510000 p, 
is 


VaRo.g99( L000) 7X 010 o00l(qo.999(¥)) = 30 000 I(®71(0.999)) ~ 7777.6597 


which compares with a total exposure of 10000 - 0.5 - 5 + 10000 - 0.5 - 1 = 30000 (all figures 
in million); note that (with a slight abuse of notation) qo.999(V) denotes the 0.999-quantile 
of the random variable V ~ N(0, 1). 
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ution 11.17 (Large portfolio asymptotics with stochastic LGD) 
We have 
pi(9) = P(Y; 1| V = v) = P(X; < di | F = —4) = P(V BF + v1 — Bie; < di | F = —4) 
d; — VBiF | ) ( di + VBiv 
= P| z; < F = =y | = P| e; < — 
( v1- Bi 


SIR.) = a(i + o) 
for u; = di/ V1 — 8; = #71 (p;)/ VI — Bi and e; = V/B;/ VI — bi, i € (1,..., m). Conditional 


independence of Y; and Y; given V follows from independence of e; and e;. 
Imagine increasing the portfolio while retaining its structure. In a portfolio of size m we 
have L™ = 377 | Li, where 


L(y) = E(L; | V = v) = 0- P(Y; = 0| V = v) 4-0.6e;P(Y; = 1| V = v) = O.6eipi(v) 


$ (p) v Pi ; 
= 0.66,0( + v), i c€11,...,m]. 
VU pio Adm t ) 
For half of the exposures (in M GBP) we have e; = 2, p; = 0.01 and b; = VB; = 0.6 giving 
$-1(0.01) 0.6 ) = 3 ) 
= v —036 vi- 036" 0.8 4" 


and for the other half of the exposures we have e; = 4, p; = 0.05 and b; = VB; = 0.8 giving 
$-!(0. 0.8 $-1(0.05 
(0.05) e) =2.40( (0.05) 
V1-—0.64 /1—0.64 0.6 
Since am = (m/2) - 2 + (m/2) - 4 = 3m, the asymptotic relative loss function is 


li (v^) + lio(v^) 
SEEN MEE 


(9) = lalt) = 06-4 +30). 


i) = lim. — (9) = z- (tut) 520) = 


i=1 


By MFE (2015, Proposition 11.18), the 99% VaR can then be approximated by 
VaRo99(L0)) ~ azo o00l(go.99(¥)) = 30 000 1(-1(0.99)) ~ 8422.8740. 


In this case we have the new i, given by 


(4) = E(L;|W = V) = 0 - P(Y; = 0| Y = y) + eE(Ai| V = 9)P(Y; = 1| V = v) 


o! i i 
= eiE(A;| V = v)pi(V) = e:8(0.5 + oe( = T Ace) 
QE(AIV — V). s py S9 3 V), 
= Hg 06eipi(V) B — 96 “W 
and thus the corresponding asymptotic relative loss function 
z . (05-4 V). 
Hur c tC) 


in terms of the asymptotic relative loss function l(Y) from b). Therefore, the 99% VaR, is 
scaled by 6(0.5 + ®—!(0.99)) /0.6 ~ 1.6627 leading to approximately 21022.4655. 
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Solution 11.18 (Importance sampling with exponential tilting for gamma) Ex. p. 90 
a) The moment-generating function of X ~ Ga(a, 8) is 


a 


Mx(t) = i exp(tz) fx (x) dx = rix exp(tz) B z^ lexp(—fx) dz 


Tr(a) 


B pe oo Ae 
^il z^ lexp(-(B — t)z) dx 


y-(8-t)o ro L3) i SW) 3 dy 
B B^ T(a) = ( B y 
Tr(a) (8-H \B-t 


which is defined for all t < 8. The importance sampling density g; is thus 


glx) = NA fx(£) = E a ex (- (d —t)r), x»50,t«B, 


which is the density of a Ga(a, 8 — t) distribution. 

b) Yes. For t < B, the mean of the (exponentially tilted) Y ~ Ga(o, 8 — t) is E(Y) = o/(B — t) 
so as t + £—, the mean can be arbitrarily large, and, as t + —oo the mean can be arbitrarily 
small (but positive). 


Solution 11.19 (Importance sampling with exponential tilting for Poisson) Ex. p. 91 


a) Let fx(k) = Bre n, k € No, denote the probability mass function of X. The moment- 
generating function of X is 


C iu = aS (ue 
Mx(t) = E(exp(tX)) = Y ett Kremt = ew D ETE 
k=0 i k=0 


= exp(—u(1 — e), tER. 


Exponentially tilting (the probability mass function of) X leads to a random variable Y 
with probability mass function 


eh) 
POY = F) = QU = ) = a) = PEP req) = AP exp(—ne!), ke No, 


which is the probability mass function of a Poi(je*) distribution, so under Q;, X is Poi(jie*) 
distributed. 


k 
b) Let fx,(k) = Pie, k € No, denote the probability mass function of X;, i € (1,...,m]. 
Since X; ^? Poi(j;), i= 1,..., m, their joint probability mass function is given by 


fx(k) = P(X; = ky... Xm = km) = [| É k = (ki,...,km) € NP. 
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Furthermore, the moment-generating function of L = 372, e; X; is 


m m Tri 
My) = Ec") = JJ Ble) = TT ts (e) = TT eot - e). 
i=1 i=1 A 
Exponentially tilting (the joint probability mass function of) X = (X1,..., Xm) leads to a 
random vector Y = (Y;,..., Ym) with joint probability mass function 
P(Yi = kisia = Em) = Q(Xı = ky,...,Xm => km) e irem) 
_ exp(t MA eiki) fx(ky = I, ecikit m us E 
Mz(t) [Ere Ore) iu d 
- II ae)” e nien 
ki! : 


which is the joint probability mass function of independent Poi(j;e%") distributions, so 
under Qj, X = (Xj,..., Xm) has a distribution with independence copula and Poi(j;e°*), 
i € (1,..., m}, marginal distributions. 

c) We need to find the value of t such that, under Q;, L has expectation c, so we need to find 
t such that 


m m 
ce = E® (L) = S e;E® (X;) S enc, 
i-l i=l 


where we used from b) that X; ~ Poi(pje%’) under Q; and thus E®(X;) = ue, i € 


Lx Wy 


Advanced 


Solution 11.20 (Importance sampling for CreditRisk") 
In CreditRisk*, default counts Y; are conditionally Poisson, see MFE (2015, Section 11.2.5): 


Y;| (V = v) ~ Poi(k;wjb). 


For simplicity we consider the one-factor case Y; | (V = v) ~ Poi(k;w;v). The goal is to estimate 
tail probabilities for the portfolio loss L = 3774 ej0;Y;; see MFE (2015, Section 11.2.5). 
We construct the ‘inner’ (so conditional on V = y) importance sampling estimator Bim (y) 
of O(W) = P(L > c| Y = v) based on a sample size ni; with the following steps: 
1) We change the measure to Q;, under which, conditional on V = 7, default counts Y 
are conditionally independent Poisson variables with mean pj = k;wjwve^!. Similar to 
Exercise 11.19 c), to get the optimal value for t, we solve 


c= EV (r|w- wv) = Y ej;EV* (Y, |w = y) = V eiókiwive". 
24 = 
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2) Generate defaults under Q;, that is generate Y, pia dus "S' Poi(jij) and compute LE) = 
Yoc eð Y for k = loa 


3) Compute the importance sampling estimator 


in 1 a k M(t; v) 
08 (yp) = = meee ns c) pt LO 


M,(t;~) =E (exp (desit) | y- v) = 2( [l exte) y- v) 


= [[E(exp(tei Y?) | V = v) = [[ Mg s (te:ð:) -]ew —u;(1 — etiàit)) 


1 i=1 


i 3 
3 


— II exp( — kjwiv(1 = eridit)). 


the form of the moment-generating function of a Poisson random variable was derived in 
Exercise 11.19 a). 


To construct the ‘outer’ (unconditional) importance estimator pu of 0 = P(L > c) based 


on a sample size Nout, consider the exponential tilting density gs(x) = en) fo (x), where fy is 


the density of V ~ Ga(a, 8) and My(s) = (8/(8 — s))*, s < B; see Exercise 11.18 a). We want 
to choose s such that the mean of V under gs(x) represents a ‘bad’ factor outcome in the sense 
of a state of the economy in which there are many defaults. T'he following steps construct pot. : 


1) Generate V4,..., V4... ^v Ga(a, B — s) (the outer importance sampling proposal distribu- 
tion). 
2) Compute 0 (y), k — 1,..., nou. 


3) Compute the importance sampling estimator 


zu 1 Nout 3t Mwy(s Nout B a 
0p 0? (Uk = in ( vw ) exp(—sW;,). 
oa Nout 2. a ame Nout > a p= $ p ) 


Note. Note that in a one-factor model, we could have equally well subsumed the weight w; in 
k; and write k; instead of k;w;. 


Solution 11.21 (Probability mass function of number of defaults under exchangeability) Ex. p. 91 
Let k € {1,...,m}. For any distinct i1,...,i, € {1,...,m}, the inclusion-exclusion principle 
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implies that 


("c *)s- ULT ("2 na - cono ine] 


WIR ! (m — k)! 
j; m m — k)! l 
TD ma e S 


Note. The point of this exercise is to realize that for k € {1,...,m}, the two probabilities 
P(M = k) and mk = P(Y = 1,..., Yi, = 1) (for any distinct i1,..., ip € {1,...,m}) are 
different. 


Solution 11.22 (Threshold model with Archimedean survival copula) 


a) 


After transformation with the marginal distribution functions, the model (X, d) is equivalent 
to a model (U, p) where U is distributed according to an Archimedean survival copula 
and p = (pi,...,pm) are the unconditional default probabilities. The random vector 
U = 1 — U thus has an Archimedean copula. We know by MFE (2015, equation (11.25) 
and Proposition 7.51 (i) and (iii)) that the components of U are conditionally independent 
given a factor V ~ G with conditional distribution function given by P(U; < u| Y = v) = 
exp(—vG-(u)). It follows that 


pil) = P(U; < pj| Y = v) = P(U; > 1— p; | V = v) 
EX exp( —»G- ! (1 — pi)) iE{1,... m}. 


When V ~ Ga(a,a), a > 0, then G(t) = (1 + t/a)~® is the Laplace transform and its 
inverse is G-!(u) = a(u-!/* — 1). We obtain from a) that 


pU) 21—exp(-vo((1— p) V^ —1), te {1,...,m}. 


Consider a Taylor series expansion of f(x) = (1 — x)-!/^ — 1 about 0. Since f(0) = 0, 
f'(x) = (1 — z)-1/o-1/o and thus f'(0) = 1/a, we have the first-order Taylor expansion 
f(x) = f(0) + f'(0)(x — 0) = x/a for x near 0 and thus k; = af (pi) ~ ap;/a = pi for small 
Pi, iE {1,... M}. 
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d) By b), the threshold model is equivalent to a Bernoulli mixture model with conditional 
default probabilities 1 — exp(—wk;). This is also the Bernoulli mixture model that is implied 
by a one-factor CreditRisk* model where defaults are conditionally Poisson given V = wv 


with default rate kj; see MFE (2015, Section 11.2.5). 
e) The joint default probability P(U; < p, U; € p) of obligors i Æ j is given by 


P(U; X p,Uj € p) = E(P(U; < p, Uj < p| V)) = E(P(U; X p| V)P(U; < p| V)) 
= E(p(V)p(V)) = E(p(V)?), 
where p(w) = 1 — exp(—vk) for k = a((1 — p)-!/* — 1). Therefore, 
E(p(v)?) = z((1 — exp(—Wk))”) = 1 — 2E(exp(—Wk)) + E(exp(—2Wk)). 


With E(exp(—Wt)) = G(t) = (1+t/a)~®, t > 0, we obtain that the joint default probability 


is given by 


E(p(V)?) = 1— 2(1-- k/a)-? + (1 + 2k/a)~* = 1 —2(1 — p) + (2(1 — p) /^ — 1). 


By l'Hópital's rule (case *oo/oo"), (2(1 — p)~!/* — 1)-* > (1 — p) for a — 0+ and thus 
lima.,9,: E(p(W)?) = 1— 2(1— p) + 1— p = p as in the case of comonotone defaults. 
By l'Hópital's rule (case ‘0/0’), (2(1 — p)- V — 1)-* > (1 — p)? for a — oo and thus 
limas% E(p(W)?) = 1 — 2(1 — p) + (1 — p)? = p? as in the case of independent defaults. 


Therefore, small o increase the default dependence in the model. 


Solution 11.23 (Copula of exchangeable CreditRiskt model) 
Let 7; be defined by 


fty = P(Ya — 0,..., Yi, = 0) = E(P(Y — 0, Yi, =0|WV)) = E(P(Y;, x 0, Yi, € 0| W)) 
k E k (cio 
-E(I[eG, «019)) - (TI S epet) ) = Eexp(- c9) 
j=l jel 


= (1+ kc/a) ? ju SC exp(-(a + kc)v) dw 
—(1-ckc/a) ^, ke€1L...,m]. 


It follows that 4 = 41 = (1 + c/o)? so that c = a(&-!/* — 1) and thus 


fy = (1+ k(o(& ^ — 1))/a) * = (ka — k +1)? = CELA... 8), ket... 


The claim now follows from the hint with the parameter 0 chosen as 1/a. 


m]. 
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